Open aindlq opened 1 month ago
I get this error also:
PREFIX crm: <http://www.cidoc-crm.org/cidoc-crm/>
PREFIX lux: <https://lux.collections.yale.edu/ns/>
SELECT DISTINCT ?what ?txt ?atxt WHERE {
?what a crm:E22_Human-Made_Object ; lux:primaryName ?txt ; lux:agentOfProduction ?artist .
?artist lux:primaryName ?atxt .
?tt ql:contains-entity ?atxt ; ql:contains-word "van gogh" .
?t ql:contains-entity ?txt ; ql:contains-word "nuit" .
}
Error: Assertion
nofBytes > 0failed. Please report this to the developers. In file "/app/src/index/IndexImpl.Text.cpp " at line 848
But if I change van
to vincent
it works as expected.
Update: For me, any token with three or fewer characters causes the exception. so "de nuit" is bad but "night nuit" is fine.
Thanks for reporting all the issues with the Text Index.
It is one of the features that hasn't been under really active development in the last years. It is good that you show interest in this feature so we can prioritize it. I have a plan for a complete rewrite of the text index which should mitigate most of the current limitations, but I think the support for Named Graphs (which people are also asking for) has some priority.
I've been trying to better understand the way how current text index works using very tiny test data, but all queries fail when data is too simple.
query:
error message:
test.wordsfile.tsv:
test.docsfile.tsv:
test.nt: