gkunter / coquery

Coquery is a free corpus query tool for linguists, lexicographers, translators, and anybody who wishes to search and analyse a text corpus.
GNU General Public License v3.0
18 stars 4 forks source link

Query lemmatization doesn't seem to work as expected #296

Open gkunter opened 6 years ago

gkunter commented 6 years ago

Test case: Corpus CELEX, query string #?ame, selected features: Word. Head

The expected behavior is to return all orthographic words that are assigned to a matching lemma. Matching lemmas are defined as any lemma that has a word-form matching ?ame. Thus, the lemma come is a matching word-form, and the query should return the word-forms come, came, coming, and comes, alongside the word forms of e.g. the lemma game (game, games, gamed, gaming).

However, only the word-forms that match ?ame are returned (e.g. came and game).