netwerk-digitaal-erfgoed / requirements-datasets

Requirements for datasets
https://netwerk-digitaal-erfgoed.github.io/requirements-datasets/
1 stars 0 forks source link

howto define subset of source dataset #60

Open coret opened 2 years ago

coret commented 2 years ago

When for example an aggregator (like NOB) publishes a dataset which originates from another organisation and is a subset of the data, like only the WO2-related images from the entire image collection of an organisation, how should this "subset" be described in the datasetdescription?

The property schema:isBasedOn makes the link with the source dataset (provenance), but this will probably be a link to the total, as the organisation just has the whole of the image collection described as dataset and not the subset.

The fact that the aggregator only used a selection (and may transformed / enriched this subset) is of interest to the user of the dataset user. Just put it in schema:description ?

bencomp commented 2 years ago

First thing that comes to mind is <LargeDataset> void:subset <SubsetOfLargeDataset> ., though that links two void:Datasets.