CLARIAH / clariah-plus

This is the project planning repository for the CLARIAH-PLUS project. It groups all technical documents and discussions pertaining to CLARIAH-PLUS in a central place and should facilitate findability, transparency and project planning, for the project as a whole.
9 stars 6 forks source link

Writeup Relation Registry ideas/plans #78

Open menzowindhouwer opened 2 years ago

menzowindhouwer commented 2 years ago

RR is an older idea (1, 2, 3, 4) that needs a fresh and actualized writeup

proycon commented 2 years ago

Probably as part of the general FAIR Vocabularies plan in the SDR v2?

menzowindhouwer commented 2 years ago

A first sketch: https://docs.google.com/document/d/19gDO8pdWiPtOrClcrsZb5qV02oqc5UDr31TAMWgIefk/edit?usp=sharing

menzowindhouwer commented 2 years ago

RELcat paper: Depending on the future size of ISOcat and RELcat the use of (almost) same as relationships might require special handling. These (loose) equivalence relationships might possibly lead to a combinatorial explosion. Some commercial triple stores already offer specific support to handle these kinds of large graphs:

  1. Oracle Database Semantic Technologies (Oracle, 2005) support so called owl:sameAs cliques which can be consolidated by choosing a clique representative, and
  2. OWLIM (ontotext, 2011) supports a similar approach using a so called master node.

In RELcat the same kind of approach could be followed: 1) either select one representative from the same-as clique or 2) give the same-as clique its own identifier. A possible complication there is the dynamic combination of relationship sets, which might require that these cliques need to be computed on the fly.

The references for Oracle and OWLIM needed to be refreshed: 1) Oracle: https://docs.oracle.com/cd/E11882_01/appdev.112/e25609.pdf section 2.2.8 2) OWLIM, now GraphDB: https://graphdb.ontotext.com/documentation/standard/sameas-optimisation.html