Using the portal
All data is available for free and can be used, for example, for business creation. For more details, see legal notice.
Datasets can be exported to WMS, WFS, KML, HTML, Excel, PDF, XML, JSON, RSS, GML, SVG, SHP, PNG, JPEG, GIF, RDF-XML, RDF-Turtle, RDF-N3, OCTET STREAM, JSON-LD, and Atom.
The portal collects all datasets from the portals it harvests, without excluding any formats. Data are collected in the file format provided by the source.
Data.europa.eu collects all datasets from the portals it harvests, without excluding datasets under non-commercial licences. Data is collected with the type of licence provided by the source.
A licence is an explicit and legally binding statement of recipients’ rights, restrictions and obligations in relation to a specific dataset. Usually, it is expressed through a written contract or through a unilateral statement from the rights holder(s), but it may also be expressed through legislation or other regulatory initiatives.
The datasets stored in the portal need to be of an appropriate quality in terms of:
- DCAT-AP-compliant mapping
- Available distributions
- Usage of machine-readable distribution formats
- Usage of known open-source licences.
The MQA presents its results in two views.
- The landing page or ‘Global Dashboard’. This view shows aggregated results for the entire service, i.e. the quality details for all catalogues.
- The second view or ‘Catalogue Dashboard’. This view allows you to select a specific catalogue for which you want to display the quality details.
The current quality indicators include the following.
- accessible distributions
- error status codes
- download URL
- top 20 catalogues with most accessible distributions,
- ratio of machine-readable datasets,
- most-used distribution formats,
- top 20 catalogues mostly using common machine-readable datasets.
- Dataset compliance statistics:
- top violation occurrences,
- compliant datasets,
- top 20 catalogues with most DCAT-AP-compliant datasets.
- Dataset licence usage:
- ratio of known to unknown licences,
- most used licences,
- top 20 catalogues with most datasets with known licences.
The visualisation tool uses the files as provided by the source. It is possible that the tool does not accept the provided file format or that the files are corrupted at the source. The portal has no influence on the source files.
The map search enables users to find datasets containing geo information from a specific region. You must type in the region or draw a bounding box on the map.
You find all information on this page.
API access URLs:
- Search: https://data.europa.eu/api/hub/search/ (Note: Only 'Read-Only' actions are currently supported for this API)
- SPARQL: https://data.europa.eu/sparql
- Registry: https://data.europa.eu/api/hub/repo/
- Use Cases: https://data.europa.eu/en/export-use-cases
API Documentation is available for the following systems:
- Search: https://data.europa.eu/api/hub/search/
- SPARQL: https://www.w3.org/TR/rdf-sparql-query/
- Download MQA reports: https://data.europa.eu/api/mqa/reporter/index.html
- How metadata is used checked by MQA: https://data.europa.eu/mqa/methodology
- MQA API: https://data.europa.eu/api/mqa/cache/index.html
- SHACL metadata validation: https://data.europa.eu/api/mqa/shacl/index.html
- SHACL metadata validation UI: https://data.europa.eu/mqa/shacl-validator-ui/
- Read access to triple store data content: https://data.europa.eu/api/hub/repo/index.html
Integration on any external application with the portal can only happen at the dataset level by using the existing CKAN-API, via which you may "extract/query" datasets.
E.g., the API calls "https://data.europa.eu/api/hub/search/#tag/Ckan" and returns the list of datasets categorised in JSON format.
You can also use the SPARQL-Manager and run customised SPARQL queries against the Virtuoso RDF triple store that is synchronised with the CKAN repository.