-
I use spaCy as a dependency for my own Python package (in case anyone's interested: lytspel). So I've added 'spacy' to the install_requires section of my setup.py. This basically works, but there are …
-
In the example below, modelling the entry with 2 senses, differentiated by POS, we'll lead us to the same issue as in #43 where we need an \ inside \ which is not a valid TEI option and looks a bit w…
-
## How to reproduce the behaviour
doc = nlp(('Hallo, ik ben Piet. Ik heb gisteren iets gekocht.'))
for w in doc:
print(w.text, w.pos_, w.lemma_)
The result is:
hallo X hallo
, PUNCT ,
ik PR…
-
Hey, thanks for this great project! I have been using the entity linking (ner) component a lot for English.
I am wondering if it is possible to run the whole wikiflow for languages not in the 12 la…
-
I added a few phrases for Anlagenbedienung/Anlagentechniker as a second corpus infile.
It then shows up correctly in the test data (labeledmorph_ger.csv):
```
4 Anlagenbedienung Anlagenbedienung …
-
Hi,
Is-it possible to train simultaneously the lemma tagger and the POS tagger ? By playing with the suggested model, I have never succeeded.
How should we format the data to use the "POS taggin…
-
- Generate a full dictionary with N-Grams (Probably only up to two)
- Remove all *Stop words* from the dictionary
- Lemmatize the whole thing with NLTK
- Create some custom Lemmatization such as yo…
-
While performing experiments on lemmatization with extended Slovene and Croatian datasets (continuation of work that resulted also in #18), I noticed that the POS information is not picked up by the s…
-
I am not sure if its an issue with wordnet corpus or with nltk or my way of using it, but I've found one case where I am not getting expected result:
~/nltk_data/corpora/wordnet/noun.exc contain this …
-
Hi,
I have tried this :
```
{
"modelname": "lemmatization-latin",
"modelpath": "models",
"input_path": "datasets/Caesar_BellumGallicum.tsv",
"dev_path": "datasets/Caesar_BellumCi…