Open moskaliukua opened 2 months ago
Hi @moskaliukua,
Thanks for highlighting this issue.
The lexicon was trained using corpus containing archaic words like Ain't
. This gets tokenised as two tokens 'Ai, not', where Ai is a Auxiliary verb.
We plan to rebuild it soon with the corrections incorporated.
Shall keep you posted.
Best, Rachna
Hi, I have run into one problem in POS tagging. in sentences like: "It is an AI" It seems to be consisten in other sentences as well:
"it made a lot of waves in the AI field." I would expect that the word "AI" is classified as PROPN, but instead I get AUX and lemma is be
versions of packages: "wink-eng-lite-web-model": "^1.8.0", "wink-nlp": "^2.3.0",