Introduction
This page details how to represent the information definition outlined in Complex and simple types using fields from ISO 19115 and DCAT2. For example, the queryDatasets operation may return DCAT structured data.
- DCAT2 is used in https://www.europeandataportal.eu/en
- DCAT3 is currently under development. It can be found at: https://www.w3.org/TR/vocab-dcat-3/
Purpose
The representations are created:
- in order to improve the current information definition. It may be, for example, possible to change the terms and definitions to be more in line with accepted international standards.
- once this has happened, the updated terms and definitions can be used to write requirements on data set catalogues.
- in order to allow implementers using different vocabularies to map to the appropriate term.
Mappings
ISO 19115 versions
The mappings use ISO 19115:2003. There are more recent versions of the standard available e.g. 19115:2014. This doesn't affect datasets too much. Mapping to the same fields.
Each table includes:
- the current information definition
- the ISO 19115:2003 field
- the DCAT3 equivalent
- the DCAT2 equivalent
- general notes. This column also contains ideas for change requests.
Dataset
Type name | Property name | Definition | Type | ISO 19115:2003 | DCAT3 equivalent | DCAT2 equivalent | General notes |
---|---|---|---|---|---|---|---|
Dataset | Metadata about a dataset. | MD_Metadata describing resource of type DS_DataSet | dcat:Dataset | dcat:Dataset | Change proposal - draft only Remodel dataset, dataset series, dps to show thay are different types of resources. In reality, the metadata about them is very similar. | ||
id | Identification of the dataset (eg a URI) | IdentifierType | MD_DataIdentification.citation.CI_Citation.identifier | @id
| dct:identifier https://www.w3.org/TR/vocab-dcat-2/#Property:resource_identifier
| Change proposal - draft only rename property name to "identifier". Update definition to "Special unique identifier permanently assigned to an entity." (AIRM) | |
title | Title of the dataset | string | MD_DataIdentification.citation.CI_Citation.title | dct:title https://www.w3.org/TR/vocab-dcat-3/#Property:resource_title
| dct:title https://www.w3.org/TR/vocab-dcat-2/#Property:resource_title
| Change proposal - draft only Update definition to "name by which the cited resource is known." (ISO 19115) | |
shortDescription | Free text providing a short description of the dataset | string | MD_DataIdentification.abstract | dct:description https://www.w3.org/TR/vocab-dcat-3/#Property:resource_description
| dct:description https://www.w3.org/TR/vocab-dcat-2/#Property:resource_description
| Change proposal - draft only rename property name to "abstract". Update definition to "brief narrative summary of the content of the resource(s)." (ISO 19115) | |
type | Indication the main content of the dataset. | CodeDatasetType | MD_DataIdentification.descriptiveKeywords.MD_Keywords.keyword | dct:type https://www.w3.org/TR/vocab-dcat-3/#Property:resource_type
| dct:type https://www.w3.org/TR/vocab-dcat-2/#Property:resource_type
| Can this be bundled into an overall classification scheme?
| |
subject | Scope of the dataset expressed as a list of subjects . | CodeDataSubjectType | MD_DataIdentification.descriptiveKeywords.MD_Keywords.keyword | dcat:theme https://www.w3.org/TR/vocab-dcat-3/#Property:resource_theme
| dcat:theme (assoc) https://www.w3.org/TR/vocab-dcat-2/#Property:resource_theme The values will need to be represented as URIs Use the PANS AIM data catalogue terms (from App1-1) as URLs: | The list of possible terms should be registered/published in order to allow URLs to be used. Recommendation is to use a controlled vocabulary. Consider using AIRM identifiers for resources e.g. dcat:theme": "urn:aero:airm:1.0.0:ContextualModel:ATMBusinessTerms:Aerodrome" | |
updateType | Indication on how far the dataset has full content or represents an update. | CodeUpdateType | MD_DataIdentification.descriptiveKeywords.MD_Keywords.keyword |
| dct:format https://www.w3.org/TR/vocab-dcat-2/#Property:distribution_format
| Update is the wrong word. Datasets are not updated. Think of a better word to reflect the temporality model It is not clear if this is the best DCAT2 field. The semantics do not match easily. We could consider e.g. prov:wasDerivedFrom "Release Type" makes more sense. A request has been raised to the DCAT3 WG https://github.com/w3c/dxwg/issues/1289 | |
isAiracUpdate | Indication that the dataset has been updated according to the AIRAC System. | boolean | MD_DataIdentification.extent.EX_Extent.description Note: this is the same approach as used in DS-META-004 - Indication that this is an AIRAC update | dct:accrualMethod
| prov:wasGeneratedBy https://www.w3.org/TR/vocab-dcat-2/#Property:dataset_was_generated_by Uses an instance of prov:Activity with attribute:
If the dataset was generated as part of the AIRAC cycle, then the value of this attribute will be “AIRAC”. | The definition of this term makes it clear that it is a statement of the process used to update the data. That is why prov is used. Consider using AIRM identifiers for resources e.g. prov:wasGeneratedBy": "urn:aero:airm:1.0.0:ContextualModel:ATMBusinessTerms:AIRAC" | |
limitationsOnUse | Free text clarifying limitations with regard to the use of the dataset. | string | MD_DataIdentification.resourceConstraints.MD_Constraints.useLimitations Can also use e.g. MD_LegalConstraints, MD_SecurityConstraints Note: this is the same approach as used in DS-META-005 - Limitations on use. | odrl:hasPolicy
| odrl:hasPolicy https://www.w3.org/TR/vocab-dcat-2/#Property:resource_access_rights https://www.w3.org/TR/vocab-dcat-2/#Property:resource_license https://www.w3.org/TR/vocab-dcat-2/#Property:resource_rights Uses an instance of odrl:Prohibition with attribute:
Example: “operational use” | Need to check what is meant here – policy is the most generic approach | |
encoding | Standard encoding of the data | CodeDataEncodingType | DQ_DataQuality.report.DQ_Eelement.result.DQ_ConformanceResult.specification | dct:conformsTo https://www.w3.org/TR/vocab-dcat-3/#Property:resource_conforms_to | dct:conformsTo https://www.w3.org/TR/vocab-dcat-2/#Property:resource_conforms_to
| ISO mapping is long-winded. Is it correct to treat it this way? | |
effectiveDate | Start of the validity period for the dataset | date | MD_DataIdentification.extent.EX_Extent.temporalElement.EX_TemporalExtent.extent Note: this is the same approach as used in DS-META-003 - Validity of the data set | dct:temporal.dct:startDate https://www.w3.org/TR/vocab-dcat-3/#Property:dataset_temporal
| dct:temporal https://www.w3.org/TR/vocab-dcat-2/#Property:dataset_temporal Uses an instance of dct:PeriodOfTime with attributes:
| Consider renaming to startValidity to be consistent. | |
endValidity | End of the validity period for the dataset. When absent, no end is specified. | date | MD_DataIdentification.extent.EX_Extent.temporalElement.EX_TemporalExtent.extent Note: this is the same approach as used in DS-META-003 - Validity of the data set | dct:temporal.dct:endDate https://www.w3.org/TR/vocab-dcat-3/#Property:dataset_temporal | dct:temporal https://www.w3.org/TR/vocab-dcat-2/#Property:dataset_temporal Uses an instance of dct:PeriodOfTime with attributes:
| ||
publicationDate | Date at which the dataset became available. | date | MD_DataIdentification.citation.CI_Citation.date Note: this is the same approach as used in DS-META-002 - Date and time when provided. | dct:issued https://www.w3.org/TR/vocab-dcat-3/#Property:resource_release_date | dct:issued https://www.w3.org/TR/vocab-dcat-2/#Property:resource_release_date
| ||
datasetSeriesId | id of the corresponding dataseries | IdentifierType | MD_DataIdentification.citation.CI_Citation.series | dct:isPartOf https://www.w3.org/TR/vocab-dcat-3/#Property:resource_qualified_relation
| dct:isPartOf https://www.w3.org/TR/vocab-dcat-2/#Property:resource_qualified_relation
| ||
geographicalScope | The locations defining the scope of the dataset | Location | MD_DataIdentification.extent.EX_Extent.geographicalElement.EX_GeographicalExtent Note: this is the same approach as used in DS-META-007 - Geographical extent of the data set | dct:spatial https://www.w3.org/TR/vocab-dcat-3/#Property:dataset_spatial
| dct:spatial https://www.w3.org/TR/vocab-dcat-2/#Property:dataset_spatial Uses an instance of dct:Location with Dublin Core, vCard, and AIXM attributes:
(Note: vCard is one of the recommended DCAT vocabularies.) | location – look into vcard Location should be based on AIRM/AIXM model | |
datasetFile | The dataset file that the metadata concerns | File | MD_Distribution | dcat:distribution https://www.w3.org/TR/vocab-dcat-3/#Property:dataset_distribution
| dcat:distribution https://www.w3.org/TR/vocab-dcat-2/#Class:Distribution Uses an instance of a dcat:Distribution with attribute:
Example values: |
DatasetSeries
Type name | Property name | Definition | Type | ISO 19115:2003 | DCAT3 equivalent | DCAT2 equivalent | General notes |
---|---|---|---|---|---|---|---|
DatasetSeries | Metadata about a dataset series. | dcat:DatasetSeries https://www.w3.org/TR/vocab-dcat-3/#dataset-series (exact details under discussion) | dcat:Catalog | ||||
id | Identification of the dataset series (eg a URI) | IdentifierType | MD_DataIdentification.citation.CI_Citation.identifier | @id
| dct:identifier https://www.w3.org/TR/vocab-dcat-2/#Property:resource_identifier | ||
title | Title of the dataset series | string | MD_DataIdentification.citation.CI_Citation.title | dct:title
| dct:title | ||
dct:description
| |||||||
type | Enumeration indicating the main content of the dataset | CodeDatasetType | MD_DataIdentification.descriptiveKeywords.MD_Keywords.keyword | dct:type | |||
dataSubject | Data subject as defined in ICAO PANS-AIM | CodeDataSubjectType | MD_DataIdentification.descriptiveKeywords.MD_Keywords.keyword | dcat:theme
| dcat:theme (assoc) | ||
limitationsOnUse | Free text clarifying limitations with regard to the use of the datasets | string | MD_DataIdentification.resourceConstraints.MD_Constraints.useLimitations Can also use e.g. MD_LegalConstraints, MD_SecurityConstraints |
| odrl:hasPolicy https://www.w3.org/TR/vocab-dcat-2/#Property:resource_access_rights https://www.w3.org/TR/vocab-dcat-2/#Property:resource_license https://www.w3.org/TR/vocab-dcat-2/#Property:resource_rights | need to check what is meant here – policy is the most generic approach | |
encoding | Standard encoding of the data | CodeDataEncodingType | dct:conformsTo https://www.w3.org/TR/vocab-dcat-2/#Property:resource_conforms_to | ||||
geographicalScope | The locations defining the scope of the dataset series | Location | MD_DataIdentification.extent.EX_Extent.geographicalElement.EX_GeographicalExtent | dct:spatial
| dct:spatial https://www.w3.org/TR/vocab-dcat-2/#Property:dataset_spatial
| ||
DPSId | id of the corresponding data product specification | IdentifierType | dct:isReferencedBy
| dct:isReferencedBy https://www.w3.org/TR/vocab-dcat-2/#Property:resource_qualified_relation Note: Need to decide on the type of relationship | |||
distribution(?) | dct:dataset
| dcat:dataset
|
DataProductSpecification
Type name | Property name | Definition | Type | ISO 19115:2003 | DCAT3 equivalent | DCAT2 equivalent | General notes |
---|---|---|---|---|---|---|---|
DataProductSpecification | Metadata about a Data Product Specification. | dcat:Resource | dcat:Resource | ||||
id | Identification of the data product specification | IdentifierType | MD_DataIdentification.citation.CI_Citation.identifier | @id
| dct:identifier https://www.w3.org/TR/vocab-dcat-2/#Property:resource_identifier | ||
title | Title of the data product specification | string | MD_DataIdentification.citation.CI_Citation.title | dct:title https://www.w3.org/TR/vocab-dcat-3/#Property:resource_title
| dct:title | ||
dpsFile | The file that the metadata concerns | File | dcat:distribution
| dcat:distribution https://www.w3.org/TR/vocab-dcat-2/#Class:Distribution Uses an instance of a dcat:Distribution with attribute:
Example values: | |||
MD_DataIdentification.descriptiveKeywords.MD_Keywords.keyword | dct:type https://www.w3.org/TR/vocab-dcat-3/#Property:resource_type
| dct:type
| |||||
dct:references https://www.w3.org/TR/vocab-dcat-3/#Property:resource_relation https://dublincore.org/specifications/dublin-core/dcmi-terms/#references
| dct:references
|
Data File
Type name | Property name | Definition | Type | ISO 19115:2003 | DCAT3 equivalent | DCAT2 equivalent | General notes |
---|---|---|---|---|---|---|---|
File | Metadata about a file. | dcat:Distribution | https://www.w3.org/TR/vocab-dcat-2/#Class:Distribution | ||||
id | Identification of the dataset file The identification is unique within a service instance | IdentifierType | dcat:accessURL https://www.w3.org/TR/vocab-dcat-3/#Property:distribution_access_url
| https://www.w3.org/TR/vocab-dcat-2/#Property:distribution_access_url | This is not the ID of the file, but the landing page, feed, SPARQL endpoint (etc) | ||
name | Name of the file | FileNameType | dct:title https://www.w3.org/TR/vocab-dcat-3/#Property:distribution_title | https://www.w3.org/TR/vocab-dcat-2/#Property:distribution_title | |||
format | Format of the file | CodeFileFormatType | dct:format https://www.w3.org/TR/vocab-dcat-3/#Property:distribution_format
| https://www.w3.org/TR/vocab-dcat-2/#Property:distribution_format | |||
size | Size of the file in Bytes. | Number | dcat:byteSize https://www.w3.org/TR/vocab-dcat-3/#Property:distribution_size
| https://www.w3.org/TR/vocab-dcat-2/#Property:distribution_size | |||
dcat:downloadURL |
Examples (using DCAT)
Describing the Dataset Series
{ "@id": "http://donlon.aero/dataset/aerodrome.json", "@type": "dcat:Catalog", "dct:identifier": "aerodrome-series-2020", "dct:title": "Donlon Aerodrome Dataset Series 2020", "dct:type": "Aerodrome Dataset", "dcat:theme": "http://icao.int/pans/aim/subject/aerodrome", "dct:conformsTo": "AIXM 5.1.1", "dct:dataset": ["http://donlon.aero/dataset/aerodrome/2020/Jan.json", "http://donlon.aero/dataset/aerodrome/2020/Mar.json", "http://donlon.aero/dataset/aerodrome/2020/May.json"], "dct:isReferencedBy": "http://donlon.aero/dps/aerodrome.json", "dct:spatial": "http://donlon.aero/location/donlon.json" }
{ "@id": "http://donlon.aero/series/aerodrome/", "@type": "dcat:DatasetSeries", "dct:title": "Donlon Aerodrome Dataset Series", "dct:description": "Aerodrome datasets for Donlon ...", "dcat:theme": "http://vocab.icao.aero/feature/aerodrome", "dct:spatial": { "@type": "http://vocab.icao.aero/feature/aerodrome", "aixm:name": "Donlon International Airport", "aixm:designator": "DONDON", " aixm:designatorIATA": "ZZZZ" }, "dct:dataset": ["http://donlon.aero/series/aerodrome/2021/Jan", "http://donlon.aero/series/aerodrome/2021/Feb", "http://donlon.aero/series/aerodrome/2021/Mar"], "dct:isPartOf": "http://donlon.aero/data-catalogue", "dct:isReferencedBy": "http://donlon.aero/product/aerodrome" }
Describing the Location
{ "@id": "http://donlon.aero/location/donlon.json", "@type": "Airport", "dct:spatial": [ { "aixm:name": "Donlon International Airport", "aixm:designator": "DONDON", "aixm:designatorIATA": "ZZZZ" }] }
Describing the Dataset
{ "@id": "http://donlon.aero/dataset/aerodrome/2020/May.json", "@type": "dcat:Dataset", "dct:identifier": "aerodrome-donlon-2020-may", "dct:title": "Donlon Aerodrome Dataset - May 2020", "dct:type": "Aerodrome Dataset", "dcat:theme": "http://icao.int/pans/aim/subject/aerodrome", "dct:format": "Baseline", "prov:wasGeneratedBy": [ { "@type": "prov:Activity", "prov:used": "AIRAC" }], "odrl:hasPolicy": [ { "@type": "odrl:Prohibition", "odrl:action": "operational-use" }], "dct:conformsTo": "AIXM 5.1.1", "dct:temporal": [ { "@type": "dct:PeriodOfTime", "dct:startDate": "2020-05-01", "dct:endDate": "2020-06-30" }], "dct:issued": "2020-04-01", "dct:spatial": "http://donlon.aero/location/donlon.json", "dct:isPartOf": "http://donlon.aero/dataset/aerodrome.json", "dcat:distribution": [ { "@type": "dcat:Distribution", "dcat:downloadURL": "http://donlon.aero/dataset/aerodrome/2020/May/data.xml" }] }
{ "@id": "http://donlon.aero/series/aerodrome/2021/Jan", "@type": "dcat:Dataset", "dct:title": "Donlon Aerodrome Dataset - January 2021", "dct:type": "http://vocab.icao.aero/info/baseline", "dcat:theme": "http://vocab.icao.aero/feature/aerodrome", "dct:accrualMethod": "http://vocab.icao.aero/info/airac", "dct:conformsTo": "http://vocab.icao.aero/info/aixm/511", "odrl:hasPolicy": { "@type": "odrl:Prohibition", "odrl:action": "http://vocab.icao.aero/policy/operational-use" }, "dct:issued": "2020-12-31", "dct:temporal": { "@type": "dct:PeriodOfTime", "dct:startDate": "2021-01-01", "dct:endDate": "2021-01-30" }, "dcat:distribution": { "@type": "dcat:Distribution", "dcat:downloadURL": "http://donlon.aero/series/aerodrome/2021/Jan/data.xml", "dcat:mediaType": "https://www.iana.org/assignments/media-types/application/gml+xml", "dct:size": "34254" }, "dcat:service": { "@type": "dcat:DataService", "dcat:endpointURL": "http://donlon.aero/api/aerodrome/", "dcat:compressFormat": "https://www.iana.org/assignments/media-types/application/gzip" }, "dct:isPartOf": "http://donlon.aero/series/aerodrome/" }
Describing the Data Product Specification
{ "@id": "http://donlon.aero/dps/aerodrome.json", "@type": "dcat:Resource", "dct:identifier": "dps-aerodrome-series-2020", "dct:title": "Data Product Specification: Donlon Aerodrome Dataset Series 2020", "dct:type": "DPS", "dca:references": "http://donlon.aero/dataset/aerodrome.json", "dcat:distribution": [ { "@type": "dcat:Distribution", "dcat:downloadURL": "http://donlon.aero/dps/aerodrome-2020-May.pdf" }] }
{ "@id": "http://donlon.aero/product/aerodrome", "@type": "dcat:Resource", "dct:title": "Data Product Specification: Donlon Aerodrome Dataset Series ", "dct:type": "http://vocab.icao.aero/info/data-product-spec", "dct:references": "http://donlon.aero/series/aerodrome/", "dcat:distribution": { "@type": "dcat:Distribution", "dcat:downloadURL": "http://donlon.aero/dps/aerodrome-dps.pdf", "dcat:mediaType": "https://www.iana.org/assignments/media-types/application/pdf" } }