gkunter / coquery

Coquery is a free corpus query tool for linguists, lexicographers, translators, and anybody who wishes to search and analyse a text corpus.
GNU General Public License v3.0
18 stars 4 forks source link

Value substitution doesn't work with subcorpus size #214

Closed gkunter closed 7 years ago

gkunter commented 7 years ago

Test case: ICE_NG, query string [boy|girl], output columns Query string and Gender. With no substituted values for the Gender column, the G-text matrix works, but if for example m is substituted by male, it fails when attempting to retrieve the log-likelihoods.

If value substitution is used for corpus features, the subcorpus functions don't work anymore because the substituted value is used to determine for example the subcorpus size.

Possible solution: get_subcorpus_size() could be made sensitive to substitutions, and could reverse the substitutions.