Incorporate corpus/lemma and dictionary/morpheme frequencies

UAlbertaALTLab / crk-db

Managing the Plains Cree dictionary database

GNU General Public License v3.0

0 stars 3 forks source link

To replace the current file: ~/giella/art/dicts/crk/Wolvengrey/W_aggr_corp_morph_log_freq.txt, with the process described here:

https://github.com/UAlbertaALTLab/cree-intelligent-dictionary/issues/163

... we'd want to implement the incorporation of comparable information with our aggregate dictionary database.

Based on the materials we have for Cree, I'd presume one or more corpus-based frequencies (not only Ahenakew-Wolfart but also Bloomfield), as well as a dictionary/morpheme-based ranking, which might be corpus-weighted as well. So these would seem features to be added to the aggregate dictionary entries.

UAlbertaALTLab / crk-db

Incorporate corpus/lemma and dictionary/morpheme frequencies #51