Dataset Register
The Dataset Register is the list of datasets in the Dutch heritage network. For each dataset, it provides a machine-readable description that includes:
- the dataset name, publisher
- information on how to access the dataset’s content, for example data dumps and SPARQL endpoints
Components
- Users can search the Dataset Register website for datasets.
- Software developers that want to find datasets may want to use the Dataset Register SPARQL endpoint.
- Software developers that want to register dataset descriptions should use the Dataset Register REST API.
Registration flow
To make a dataset description visible on the Dataset Register website, Data Platforms follow these steps (see also the Requirements for Datasets):
- A Collection Manager produces a dataset description and publishes it on the web (e.g. on a website or in a SPARQL endpoint).
- The URL to the dataset description is registered with the Dataset Register.
- The Dataset Register periodically fetches all dataset descriptions, validates them, and stores them for later retrieval.
- The Dataset Knowledge Graph periodically fetches valid descriptions from the Dataset Register, analyses linked datasets, and stores their summaries.
- When users consult the Dataset Register website, information from the Dataset Register and the Knowledge Graph is combined.