-
When merging two tokens, the text attribute of the new token is tokena.text + ' ' + tokenb.text. I would much prefer the ability to specify a new text label such as tokena.text + '-' + tokenb.text or …
ghost updated
6 years ago
-
When merging two tokens, the text attribute of the new token is `tokena.text + ' ' + tokenb.text`. I would much prefer the ability to specify a new text label such as `tokena.text + '-' + tokenb.text`…
-
HI,
So I noticed there are a lot of German language related parts in the code. But when I try to initialize
```
from spacy.de import German
nlp = German()
```
I get
model 'de>=1.0.0,
-
Go to rinap/rinap5 and open, for example, Asb004.atf.
In Aquamacs, run `Check ATF`.
The following results are displayed:
`atf-check`
`Asb004.atf:104: (lem) no FORM e-liti; 1 match for elīti[upper]…
-
From @gregorycrane:
> https://lk353.eu1.eldarioncloud.com/reader/urn:cts:greekLit:tlg0011.tlg003.perseus-grc2:55/
>
> the vocab does not line up -- i was searching for erxomai but I don't see it…
-
I've been looking at the Russian-SynTagRus data, and there is a choice made in selecting the lemma, that aspect seems to be treated as an inflectional feature. So, the lemma of the perfective verb is …
-
Hey,
I noticed something weird when finding the lemma_ of tokens.
When I find the lemma_ for the token for 'cakes': `nlp("cakes")[0].lemma_`, I get what is expected: 'cake'.
The same thing appli…
-
Would it make sense to also add the udpipe r package as a backend. That package also has no external dependencies and provides tokenisation, lemmatisation, pos tagging, feature tagging and dependency …
-
`Token.lemma` right now returns only a single lemma, as far as I understand.
As you know there are always words in a language that are "clashing" surface forms of different lemmata.
_coating -> coati…
-
We might start with the Tufts Core MA reading list:
https://ase.tufts.edu/classics/graduate/readingLists.htm
What texts have individuals read?
How much vocab is in those texts?
What vocab is new…