sailfish-keyboard / presage

Fork of Presage (http://presage.sourceforge.net/)
GNU General Public License v2.0
6 stars 10 forks source link

simplify tokenizer on import and request cleaned up training text file #14

Closed rinigus closed 6 years ago

rinigus commented 6 years ago

related to https://github.com/rinigus/presage/issues/13 and missing words with ' in English

rinigus commented 6 years ago

Import can be done using external tools , such as NLTK. Problems with tokenization should be taken for each language separately