Closed epoz closed 1 month ago
Just to make sure that I understand your question correctly:
Are you wondering about the strange predicate @de@<http:....>
which is (wrongly) marked as a literal?
Those predicates and their triples are artifacts of QLever's implementation of language filters. They are currently leaked for queries without a fixed predicate in all triples (your predicate is a variable ?p
).
It is on our list to fully hide these triples (wrt to exports/query results/ statistics) such that this can't happen.
Thanks for reporting this, we now have an issue to track. It is also interesting to see, how these triples currently interact with the JSON exporter. Please confirm that I have correctly identified your issue.
Thanks for the answer, yes I was wondering about the predicates being marked as literals in the SPARQL query results. First time I saw something like this was in a CMS vendor publishing their Linked Data one year ago. Filed a bug report there, but never heard back from them. Now we know that they are using QLever as a triplestore "under the covers" ;-)
I am new to QLever, just ran some tests ingesting data last week and noticed the strange output. At first I thought it was something wrong in my own data, but then noticed it was also in the sample datasets provided. (and I recalled seeing this previously elsewhere, as mentioned)
Look forward to learning more about the internals later, good to know that this is an implementation artefact and will eventually disappear.
@epoz Interesting story, which CMS vendor was this? Is there a issue on their GitHub or was this some internal bug tracker?
It is a Dutch CMS for cultural heritage content: https://kleksi.com/nl/home Their service is not open source, and does not have a bug tracker. I submitted a request to their support@ mail, and received a reply via email later from the owner that they would pass my query on to their tech department. Never heard back from them, and I did not consider it important enough to follow up.
This was the event where we were spelunking for interesting data sources.
When indexing data with QLever, sometimes strange values are returned for the predicates. To reproduce:
This produces values that look like:
Notice the weird literal shown as a predicate, as if there was a linebreak issue in parsing the source data. Here we are showing one of the example datasets provided by the authors, the DNB set.
But this has also happened in some cases for triples that I have created myself, where I could verify that the input triples did not seem malformed.