netwerk-digitaal-erfgoed / dataset-knowledge-graph

Pipeline that generates the NDE Dataset Knowledge Graph
European Union Public License 1.2
3 stars 0 forks source link

License(s) per datasets seems incorrect #54

Closed coret closed 11 months ago

coret commented 11 months ago

Issue https://github.com/netwerk-digitaal-erfgoed/dataset-knowledge-graph/issues/25 brought the analysis of used licenses. When I look at the licenses per dataset via query on the dataset-knowledge-graph, I see several datasets have multiple licenses, which incorrect, for example:

https://studiezaal.nijmegen.nl/AtlantisPubliek/data/dataset/LOD+Beelddocumenten

And I wonder about https://studiezaal.nijmegen.nl/AtlantisPubliek/data/dataset/Catalog. Although a Catalog does have a license (http://creativecommons.org/publicdomain/zero/1.0/ in this example), I wonder if we should count this as a Dataset?

ddeboer commented 11 months ago

@coret As we discussed today (and the readme says), these are:

Licenses that apply to resources in the dataset.

So not to the dataset as whole, but to individual objects within the dataset. Do the results make more sense that way? If not, please let me know!

coret commented 11 months ago

Yes, I've checked the Gouda Timemachine KG and there appear multiple CC licenses (occuring within the datasetdescriptions which are part of this KG).

I do see that the query mentioned does produce significantly less results today than 3 days ago (Gouda Timemachine isn't part of the resultset, nor is Nijmegen).