KorAP / Krill

:mag: A Corpus Data Retrieval Index using Lucene for Look-Ups
BSD 2-Clause "Simplified" License
16 stars 3 forks source link

Position queries are broken #10

Closed Akron closed 8 years ago

Akron commented 8 years ago

@jbingel found a bug (both Poliqarp):

overlaps(<s>, [orth=Mann])

and

overlaps([orth=Mann], <s>)

return different results.

A similar issue arises with the following queries:

[cnx/syn=@NH & corenlp/ne_dewac_175m_600=I-ORG]
contains([cnx/syn=@NH], [corenlp/ne_dewac_175m_600=I-ORG])

I guess the problem is too optimistic forwarding. Should be fixable with the switch to bitvector comparisons.

And sometimes matches are not shown - e.g. in the following query in the current version of wikipedia one of three matches is skipped:

http://10.0.10.14:6666/?q=contains%28%3Cs%3E%2C+ich%29&ql=poliqarp&cutoff=1 (Internal instance)

This bug was transfered from Trac Issue #127

Akron commented 8 years ago

Fixed in 0.55.4 (as part of the WithinSpans lazy loading bug).