-
After upgrading from version 2.2.2 to 2.3.0 one of our existing jobs breaks due to a `ClassCastException` when running a Tokenizer that feeds into a Lemmatizer. See error description below.
## Desc…
-
## How to reproduce the behaviour
doc = nlp(('Hallo, ik ben Piet. Ik heb gisteren iets gekocht.'))
for w in doc:
print(w.text, w.pos_, w.lemma_)
The result is:
hallo X hallo
, PUNCT ,
ik PR…
-
## How to reproduce the behaviour
Hi,
I want to add a new language to "spacy-lookups-data", I already made a necessary changes on setup.cfg, setup.py and __init__.py files, and a new generated az_…
-
Add text parsing logic for parsing dataset, and retrieving commodity from vision API
-
Would be great if we could have stemmers addition to "other" section to reduce dimensionality of NLP. Cutting down some wasted memory/processing time from things like plurals and generating stronger l…
-
I installed ginza.
```
pip3 install "https://github.com/megagonlabs/ginza/releases/download/latest/ginza-latest.tar.gz"
~~~
Successfully installed SudachiDict-core-20190531 SudachiPy-0.4.0 blis-…
-
Hello all,
I am looking to extend the croatian language model in spaCy with a look-up lemmatizer.
I found a great source of lemmas (over 100 000 lemmas in more than a million forms) here : http://…
-
Not sure if this is an issue or not, but there are several files in the Norwegian (`nb`) lemmatizer directory that are basically empty. Here's an example - this is the whole file:
```
# coding: ut…
-
I am trying to lemmatize text but I keep running into this issue even though I am not using pos-tagging:
`Caused by: `java.io.IOException: Unable to open` "edu/stanford/nlp/models/pos-tagger/englis…
-
Greetings!
I'm not sure whether it's from my inexperience with PyTorch or conll (if it's even relevant), but I'm having trouble with understanding the input/output files necessary in training a new…