netwerk-digitaal-erfgoed / dataset-knowledge-graph

Pipeline that generates the NDE Dataset Knowledge Graph
European Union Public License 1.2
2 stars 0 forks source link

Coordinate with CLARIAH FAIR Vocabularies summarizer #9

Open ddeboer opened 2 years ago

ddeboer commented 2 years ago

In CLARIAH+, a dataset description summarizer will be developed: https://github.com/CLARIAH/clariah-plus/issues/73 and https://github.com/CLARIAH/clariah-plus/issues/71. Is there overlap with the NDE Knowledge Graph Pipeline? Can we share development?

Their source code is now available at https://github.com/CLARIAH/did-summarizer.

ddeboer commented 1 year ago

@4tikhonov Some first observations:

  1. The NDE pipeline outputs proper RDF (VoID), while CLARIAH’s pipeline does not. RDF is preferred because it’s easier to interpret by users.
  2. NDE uses SPARQL queries where we can while the CLARIAH pipeline runs through the RDF triple by triple. How does that scale? Does it stream?
  3. The NDE pipeline is modular and will be configurable while the CLARIAH pipeline seems to be less so.
ddeboer commented 1 year ago

See https://docs.google.com/spreadsheets/d/1hY2zPmFDZ6aIlY-MupK6gvynQZucOxikFDgv5YWxFyQ for a CLARIN/NDE/CLARIAH Fair Implementation Profile (FIP).