-
**Describe the bug**
After having a model for lemmatization trained in `classla-spoken/models/lemma/baseline.pt`, running the usual code for predicting with the model
```
python -m classla.mode…
-
While building an English [lemma validator](https://github.com/rhdunn/conllu-en-validator/blob/master/validator/lemma.py) and using it to check the UD English treebanks, I've identified two cases wher…
-
Hi, we are soon going to release [Annif](https://github.com/NatLibFi/Annif) 1.0. We are currently depending on simplemma 0.9.1, which is the last released version, released on January 20th.
Since t…
osma updated
6 months ago
-
The provided **StandfordCoreNLP-German.properties**
props.setProperty("annotators", "tokenize, ssplit, mwt, pos, ner, depparse");
Will adding lemma to the properties works?
props.setProperty("a…
-
**The problem**
Manticore cannot find document with 'см' or 'км' Russian abbreviations (centimeters and kilometers respectively)
**To Reproduce**
I have rt-index with document, that has the follo…
-
We are using the phrase "root word" as a catchall term. There are many words that are misspellings of a common slur. For instance fuck could be spelled as fck, fk, fcuk etc.
This was highlighted a…
-
**Describe the bug**
Relative path to log not working anymore after update 6.0.4 -> 6.2.0 5.0.2 -> 6.x. Error on startup:
`[17:20.227] [6392] FATAL: failed to open log file '..\log\searchd.log': No …
-
It would be great to have Ancient greek as its not available in the list of languages even though it is already in Wiktionary
-
سلام و عرض ادب. ممنون بابت توسعه این ابزار
من میخواستم پسوند یا همان شناسه یک فعل را پیدا کنم ولی نمیدونستم اصلا امکان این وجود داره یا نه
مثلا برای فعل میرویم، 'یم' بهم داده بشه. چند تا مثال دیگه …
-
Hello,
I realized that WordNetLemmatizer.lemmatize() would lemmatize word "us" to "u", which is not ideal most of the time. Maybe this issue could be fixed?
Best,
BW