Simon-Dirks / view-a-LOD

A flexible and configurable LOD viewer using SPARQL and Elastic endpoints.
GNU General Public License v3.0
0 stars 2 forks source link

Fix RAZU "is onderdeel van" issues #149

Open Simon-Dirks opened 1 month ago

Simon-Dirks commented 1 month ago

Now: Showing "Gerechtsbestuur Amerongen, Elst en Ginkel" many times.

Image

To preview: prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> prefix mdto: <http://www.nationaalarchief.nl/mdto#> select * where { ?s <http://www.nationaalarchief.nl/mdto#isOnderdeelVan> ?o. ?o mdto:begripLabel ?oLabel . } limit 1000 on https://data.razu.nl/razu/PoCAmerongen2024/sparql.

Wietse mentioned on June 19th that the SPARQL should instead look more like this: prefix rdfs: http://www.w3.org/2000/01/rdf-schema# prefix mdto: http://www.nationaalarchief.nl/mdto# select * where { ?s http://www.nationaalarchief.nl/mdto#isOnderdeelVan/http://www.nationaalarchief.nl/mdto#identificatie/http://www.nationaalarchief.nl/mdto#identificatieKenmerk ?o. ?o mdto:naam ?oLabel . } limit 1000

Unfortunately we can not do any hops with elastic, it uses a completely "flat" index so only knows of "isOnderdeelVan" at the root level of the node and nothing more. We ran into similar issues with the The Utrecht Archives dataset earlier, but have no support (yet) for tweaking elastic indexing settings at Triply.

Even if I use a custom SPARQL query for rendering the "hierarchy" view for RAZU, the filter options would still be off as those are all handled by elastic.

Afraid we have to wait for more control over the elastic indexing process in order to resolve these issues. (either by self-hosting the triple store + elastic, or by Triply giving us more control over this)