KorAP / Krill

:mag: A Corpus Data Retrieval Index using Lucene for Look-Ups
BSD 2-Clause "Simplified" License
16 stars 3 forks source link

Fallback for element contexts #27

Open Akron opened 7 years ago

Akron commented 7 years ago

Currently, Krill refers and relies to some annotations from the base, namely s to set boundaries for annotation retrieval and p for snippet retrieval (in case, this is wanted). But - this fails in case a match is not in a sentence or a boundary, which can happen with the new data from Wikipedia.

For match retrieval there should exist a fallback mechanism to use token contexts whenever sentence- or paragraph-contexts fail.