Dataset Knowledge Graph
The Dataset Knowledge Graph enriches the Dataset Register with insights derived from the actual content of each dataset. A pipeline periodically fetches valid dataset descriptions, analyses the RDF distributions they point to, and produces Dataset Summaries – modelled as VoID – that describe the empirical shape of each dataset:
- the counts of RDF triples, subjects, predicates, and objects (split into literals and URIs);
- the classes that occur and how many instances each has;
- the predicates that occur and how they are distributed across subject classes;
- the external terminology sources and vocabularies the dataset links to.
These summaries help researchers, software developers, and data platform builders assess whether a dataset is suitable for their purpose and which query patterns they can use to access it.
Access
- Explore aggregated insights across all datasets in the Dataset Knowledge Graph datastory.
- Query the graph directly via the SPARQL endpoint at
https://triplestore.netwerkdigitaalerfgoed.nl/repositories/dataset-knowledge-graph. - The source code of the pipeline is available on GitHub.