Giter Club home page Giter Club logo

dxwg's Introduction

W3C Dataset Exchange Working Group (DXWG) - DCAT

This repository contains the Dataset Catalogue Vocabulary (DCAT) work of the Dataset Exchange Working Group (DXWG).

DCAT Working Documents

Other DXWG Documents

dxwg's People

Contributors

acka47 avatar agbeltran avatar agreiner avatar aidig avatar aisaac avatar andrea-perego avatar cburle avatar davebrowning avatar dozed avatar dr-shorthair avatar fanieli avatar heidivanparys avatar jakubklimek avatar kcoyle avatar larsgsvensson avatar lvdbrink avatar makxdekkers avatar nicholascar avatar nichtich avatar pchampin avatar plehegar avatar pwin avatar rawmatt avatar riccardoalbertoni avatar rob-metalinkage avatar rubenverborgh avatar swickr avatar xfq avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dxwg's Issues

Related datasets [RRDS]

Related datasets [RRDS]

Ability to represent the different relationships between datasets, including: versions of a dataset, collection of datasets, to describe their inclusion criteria and to define the 'hasPart'/'partOf' relationship, derivation, e.g. processed data that is derived from raw data

this requriement to be rolled in here: Update method of Dataset: Indicate the update method of a Dataset description, e.g. whether each new dataset entirely supercedes previous ones (is stand-alone), or whether there is a base dataset with files that effect updates to that base.


Related requirements:  Provenance information [RPIF] 
Related use cases: Relationships between Datasets [ID32] Define update method [ID47] 

Profile representation [RPFRP]

Profile representation [RPFRP]

Create a way to retrieve more information about a profile. This must be flexible enough to support human and machine readable resources, such as input and editing guidance, validation resources, usage notes etc.

Following additional requirements consider the representation of a profile (document), expressing concrete constraints, compared to the definition of the concept in#RPFDF

  1. Machine-readable specifications of application profiles need to be easily publishable, and optimize re-use of existing specifications.
  2. Application profiles need a rich expression for the validation of metadata
  3. Profiles must have properties for at least two levels of documentation: 1) short definition 2) input and editing guidance
  4. Profiles must support declaration of vocabulary constraints
  5. A mechanism must be available to identify conformance to each inherited profile given conformance to a profile that specialises it.
  6. Profiles list valid vocabulary terms for a metadata usage environment
  7. Profile vocabulary lists may be defined as closed (no other terms are allowed) or open (other terms are allowed)#ID37
  8. Profiles should reuse vocabulary terms defined elsewhere (Dublin Core profiles; no use case)
  9. Profiles must be able to support information that can drive data creation functions, including brief and detailed documentation#ID46
  10. Profiles must support discoverability via search engines#ID40
  11. Profiles must have identifiers that can be used to link the DCAT description to the relevant profile

Related requirements: Profile definition [RPFDF] Profiles listing [RPFL] 
Related use cases: Detailing and requesting additional constraints (profiles) beyond content types [ID2] Vocabulary constraints [ID41] Profile support for input functions [ID46] 

ID37 Europeana profile ecosystem: representing, publishing and consuming application profiles of the Europeana Data Model (EDM)

Discussed at F2F day 2 (July 18, 2017) but not resolved. See minutes:

https://www.w3.org/2017/07/18-dxwg-minutes

Main issue was that of "nesting profiles" - efficient but complex if nesting profiles owned by others; could change in unexpected ways. Also the question of whether profiles can narrow but not expand vocabulary semantics.

Makx reports that DCAT-AP is stand-alone, copying in those elements of DCAT that it will use, and adding others.

Some comments from minutes:
maybe https://en.wikipedia.org/wiki/Composition_over_inheritance is another way to couch the issues?
antoine: relationship between profiles needs to be more flexible/fluid

dsr: More like delegation of responsibilities

phila: ODRL has inheritance but answer is prohibit/allowed with default if indeterminate

a question of use cases for knowing details about profiles in DCAT. A clean model would only require the minimum information for handing over to the profile which would be responsible for handling any sub-profiles.
phila: in this space we would need some conflict resolution
... we'd have to prove it....

alejandra: in most cases profiles would just be 'leaves' - but looking across profiles is still a need

(recording a related if old technology: http://www.rddl.org/ was a design for xml namespaces that made a simple HTML page - a bit like a 'profile' - with pointers to machine formats like DTDs, XSDs etc.)
kcoyle: Do profile definers find that they are re-using existing elements or extending.

Profiles listing [RPFL]

Profiles listing [RPFL]

Create a way to list the profiles implemented by a dataset or a specific distribution

Some subset of profile metadata that can be included in a DCAT context should have a canonical set of properties recommended. Initial requirements are 1) short definition 2) input and editing guidance. Links should be consistent with#RPFRP


Related requirements: Profile definition [RPFDF] Profile negotiation [RPFN] Profile representation [RPFRP] 
Related use cases: Profile support for input functions [ID46] 

Distribution package [RDIP]

Distribution package [RDIP]

Define way to specify content of packaged files in a Distribution. For example, a set of files may be organised in an archive format and then compressed, but dct:hasFormat property only indicates the encoding type of the outer layer of packaging.


Related use cases: DCAT packaged distributions [ID1] 

Dataset aspects [RDSAT]

Dataset aspects [RDSAT]

Provide recommendations and mechanisms for data providers to describe datasets with a formal description of aspects (e.g. instrument/sensor used, spatial feature, observable property, quantity kind).

Finer grained semantics will also allow dataset dimensions to be described, and distributions described using these semantics - for example how a dataset is composed of multiple subsets, such as a set of image bands or tiles, or parameterised filtering/subsetting services

This requirement applies to catalogues of DCAT records, and is thus related to the concept of profiles, which are expected to define classification dimensions (use of controlled vocabularies in mandatory properties)


Related requirements: Profiles listing [RPFL] Distribution schema [RDIS] 
Related use cases: Support associating fine-grained semantics for datasets and resources within a dataset [ID7] Profile support for input functions [ID46] Europeana profile ecosystem: representing, publishing and consuming application profiles of the Europeana Data Model (EDM) [ID37] Summarization/Characterization of datasets [ID33] 

Provenance information [RPIF]

Provenance information [RPIF]

Provide a way to link to structured information about the provenance of a dataset including:

  • the input data used to create a dataset to the dataset.
  • the software used to produce the dataset to the dataset.
  • an extensible model different types of agent roles
  • funders

dct:creator, dct:publisher etc are special cases, which require guidance, further roles may be defined in provenance or other richer models. The requirement is to establish an extensible mechanism, and for profiles to specify canonical equivalents for the special case properties of dcat:Dataset


Related use cases: Common requirements for scientific data [ID9] Modeling data lineage [ID12] Modeling agent roles [ID13] Modeling funding sources [ID31] 

Distribution schema [RDIS]

Distribution schema [RDIS]

Define a way to include identification of the schema the described data conforms to

This may include rich information via extensions points, URI templates and parameters, dimensions and subsetting operations, dereferenceable identifiers of service behaviour profiles and canonical identifiers of well-known web service interfaces (e.g. OGC - WFS, WMS, OpenDAP, REST apis).

Such a description may be provided through identifier of a suitable profile that defines interoperability conditions the distribution conforms to.


Related requirements: Profiles listing [RPFL] Dataset aspects [RDSAT] 
Related use cases: DCAT Distribution to describe web services [ID6] Summarization/Characterization of datasets [ID33] Data access restrictions [ID17] Template link in metadata [ID22] 

Dataset access [RDSA]

Dataset access [RDSA]

Provide a way to specify access restrictions for both a dataset and a distribution.

Provide a way to define the meaning of the access restrictions for a dataset or distribution and to specify what is required to access a dataset and distribution.


Related use cases: Data access restrictions [ID17] 

Dummy issue [RDMY]

Dummy issue [RDMY]

Provide clear guidance on conditions, type and severity of a resource's update that might motivate the creation of a new version in scenarios such as dataset evolution, conversion, translations etc, including how this may assist change management processes for consumers (e.g. semantic versioning techniques)

Test

Distribution service [RDISV]

Distribution service [RDISV]

Ability 1) to describe the type of distribution and 2) provide information about the type of service

Such a description may be provided through a suitable profile identifier that defines a profile of the relevant service type.


Related requirements: Profiles listing [RPFL] Distribution schema [RDIS] 
Related use cases: DCAT Distribution to describe web services [ID6] Modeling service-based data access [ID18] Machine actionable link for a mapping client [ID21] 

Data quality model [RDQM]

Data quality model [RDQM]

Identify common modeling patterns for different aspects of data quality based on frequently referenced data quality attributes found in existing standards and practices.

This includes potential use and revision of DQV

Aspects include:

  • the degree of a dataset's precision (i.e. measure of resolution or variability).
  • the degree of a dataset's accuracy (i.e. measure of correctness).
  • the degree a dataset conforms to a stated quality standard.
  • details of data quality conformance test results.

  • Related use cases: Modeling data precision and accuracy [ID15] Data quality modeling patterns [ID14] Modeling conformance test results on data quality [ID16] Machine actionable link for a mapping client [ID21] Template link in metadata [ID22] Data Quality Vocabulary (DQV) Wish List left by the DWBP WG [ID23] 

    Profile definition [RPFDF]

    Profile definition [RPFDF]

    Create a sufficiently wide definition of an application profile to address declaration of interoperability profiles data may conform to, and through this mechanism provide the means for DCAT instances and collections to also declare the profiles of DCAT they conform to.

    These Use case specific requirements apply to the required scope of this definition. Where appropriate these are also captured as additional requirements.

    1. Clarify any relationship between profiles and validation languages
    2. Profiles have URI identifiers that resolve to more detailed descriptions
    3. Each application profile needs to be documented, preferably by showing/reusing what is common across profiles
    4. Profiles should provide both machine and human readable views.
    5. Profiles may inherit clauses from one or more parent profiles
    6. Profiles must support a view that includes all inherited constraints clearly identifying the parent profile the constraint is defined in.
    7. Profiles may be used to describe the metadata standard a description conforms to, the standards to which the resource described (e.g. dataset) and the standards each distribution conforms to.
    8. Conceptually, profiles can extend other vocabularies or profiles, or can be refinements of other vocabularies or profiles (#ID37)
    9. Responses can conform to multiple, modular profiles (#ID3)


    Related requirements: Profile representation [RPFRP] Profile negotiation [RPFN] Profiles listing [RPFL] 
    Related use cases: Detailing and requesting additional constraints (profiles) beyond content types [ID2] Responses can conform to multiple, modular profiles [ID3] Discover available content profiles [ID5] Description of dataset compliance with standards [ID43] Profile relation to validation [ID48] 

    Dataset type [RDST]

    Dataset type [RDST]

    Provide a mechanism to indicate the type of data being described and recommend vocabularies to use given the dataset type indicated.

    Providing examples of scope will provide guidance, without being unnecessarily restrictive. The key requirement is interoperability, achieved by using standardised vocabulary terms. It it unclear whether a canonical registry is required or whether communities should constrain choice via DCAT profiles.


    Related requirements: Dataset aspects [RDSAT] 
    Related use cases: Scope or type of dataset with a DCAT description [ID8] Modelling resources different from datasets [ID20] 

    Recommend Projects

    • React photo React

      A declarative, efficient, and flexible JavaScript library for building user interfaces.

    • Vue.js photo Vue.js

      🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

    • Typescript photo Typescript

      TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

    • TensorFlow photo TensorFlow

      An Open Source Machine Learning Framework for Everyone

    • Django photo Django

      The Web framework for perfectionists with deadlines.

    • D3 photo D3

      Bring data to life with SVG, Canvas and HTML. 📊📈🎉

    Recommend Topics

    • javascript

      JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

    • web

      Some thing interesting about web. New door for the world.

    • server

      A server is a program made to process requests and deliver data to clients.

    • Machine learning

      Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

    • Game

      Some thing interesting about game, make everyone happy.

    Recommend Org

    • Facebook photo Facebook

      We are working to build community through open source technology. NB: members must have two-factor auth.

    • Microsoft photo Microsoft

      Open source projects and samples from Microsoft.

    • Google photo Google

      Google ❤️ Open Source for everyone.

    • D3 photo D3

      Data-Driven Documents codes.