-
The parser tests for some ATF features are missing meaningful assertions and therefore pass trivially.
This is because these tests were initially skipped while the relevant features were not suppor…
-
## How to reproduce the behaviour
import spacy
nlp = spacy.load('de')
s1 = 'Der schöne Garten' …
-
Hi. Here is an issue I'm getting using some French pipelines (fr_core_news_lg or fr_dep_news_trf).
As you can see it works in some cases but fetches the wrong lemma in some other cases.
So far I've …
-
Among open issues, we have (not an exhaustive list):
- #135 complains about the sentence tokenizer
- #1210, #948 complain about word tokenizer behavior
- #78 asks for the tokenizer to provide offsets …
-
- '-럽다'로 끝나는 용언 중에서 '-워'가 생략되는 경우가 있음.
- 간지럽다 + 워 -> 간지러워 -> (워 생략) -> 간지러
lovit updated
5 years ago
-
### Description
Apache OpenNLP functionality has been available in Lucene starting with [v7.3.0](https://lucene.apache.org/core/7_3_0/analyzers-opennlp/index.html). Based on a request from one of my…
-
-
Manticore Search 3.6.0 added integration with pymorphy2 as a part of the [Lemmatizer for Ukrainian](https://github.com/manticoresoftware/lemmatizer-uk) . Does it make sense to mention it somewhere in …
-
Pymorphy3 uses a much more recent version of Ukrainian dictionaries (VESUM).
https://github.com/no-plagiarism/pymorphy3
-
The initial run of the script is slow due to needing to load the following dependencies, and initialize the Lemmatizer and Stemmer
```
from nltk.corpus import wordnet as wn
from nltk.stem import …