-
-
Hi, there's a problem with the lemmatization of a word Zoom, which becomes "zoo".
-
Hi
Should I continue reporting the Japanese lemmatization errors I find, or do you have enough to go on for now?
Featured in the screenshot bellow you can see how おとせる is not the proper lemma fo…
-
Looking at the lemmatization across the English treebanks, I've found some inconsistencies in the lemmatization of punctuation tokens between those treebanks:
- ellipsis:
- `...` (`\u002E\u002E\…
-
While editing the corpus I See Your Eagerness, I noticed the following:
- ⲉⲧⲃⲏⲧ doesn't (but should) lemmatize to ⲉⲧⲃⲉ
- ⲉⲙ doesn't (but should) lemmatize to ⲛ
- ⲑⲩⲥⲥⲁⲥⲧⲏⲣⲓⲟⲛ doesn't (but should) lemm…
-
All corpora need to be checked for lemmatization of ⲩⲛⲟⲩ; should be ⲟⲩⲛⲟⲩ. See [this ANNIS search](https://corpling.uis.georgetown.edu/annis/scriptorium#_q=bm9ybT0i4rKp4rKb4rKf4rKpIiBfPV8gbGVtbWE9IuK…
-
May be unnecessary for Release 0.1
-
While defining create_component, only lemmatizer_path is getting passed and there is not an option to utilise use_plain_lemmatization from spaCyIWNLP, but in spaCyIWNLP's constructor we can pass use_p…
-
Hi,
First of all, I want to thank you for sharing this app. It's really helpful and impressive work.
I was looking through the code and I've noticed that you have used the Ext JS framework (if I…
-
Добрый день,
при попытке просто повторить код лемматизации из примера с израильским послом:
for token in doc.tokens:
token.lemmatize(morph_vocab)
print(doc.tokens[:5])
{_.text: _.lemma …