KorAP / Krill

:mag: A Corpus Data Retrieval Index using Lucene for Look-Ups
BSD 2-Clause "Simplified" License
16 stars 3 forks source link

Missing matches #42

Closed margaretha closed 6 years ago

margaretha commented 6 years ago

Using https://github.com/KorAP/Kustvakt/tree/master/sample-index, Krill does not return any matches for:

{
    "query": {
        "@type": "koral:token",
        "wrap": {
            "@type": "koral:term",
            "match": "match:eq",
            "key": "der",
            "layer": "orth",
            "foundry": "opennlp"
        }
    },
    "collection": {
        "@type": "koral:docGroup",
        "operation": "operation:and",
        "operands": [
            {
                "@type": "koral:doc",
                "match": "match:eq",
                "type": "type:regex",
                "value": "CC-BY.*",
                "key": "availability"
            },
            {
                "@type": "koral:doc",
                "match": "match:ne",
                "value": "GOE/AGI/00000",
                "key": "textSigle"
            }
        ]
    }
}

This happens because GOE/AGI/00000 is not part of the index. Replacing it with GOE/AGI/04846 will return some matches.

margaretha commented 6 years ago

See de.ids_mannheim.korap.collection.TestWithExternalIndex.testIndexTextSigleNe() in ecddb0b23d63605160c19b22353545727ec80957.