-
**To Reproduce**
Steps to reproduce the behavior:
```
import stanza
stanza.download('pt')
stnz = stanza.Pipeline('pt', use_gpu=False)
text = stnz("convido-os a levantarem-se para um minuto de si…
-
There are a number of codes (references to documents, mainly) which degrade automatic linguistic preprocessing (e.g. tokenization, lemmatization, PoS, sentence splitting...).
An idea would be to an…
-
I think this is just an error. The two lemmas are unrelated in meaning, and φυλάζω, to divide, doesn't seem to exist in Homer. Cunliffe uses this as his first example of φυλάσσω.
-
## How to reproduce the behaviour
```
import spacy
nlp = spacy.load("de_dep_news_trf")
assert nlp("Du ißt Äpfel")[1].lemma_ == 'essen'
print(nlp("Du isst Äpfel")[1].lemma_)
```
This prints `i…
-
This needs to be documented (and linked to from monarch technical stack docs)
Basic summary:
the annotator is generic (compared to a fine-grained annotator like biolark that is specific to HPO).…
-
For example, "quickly" is not reduced to "quick."
It looks like there are lemma files for nouns and verbs, but not for adjectives. Is there a resource for english adjective lemmatization that could b…
-
We should standardize these and enforce in the validator. As is, e.g. "its" is sometimes lemmatized as "it".
The UD lemmatization policies have evolved and are summarized [here for pronouns](https:…
-
Reference: https://github.com/monarch-initiative/mondo-ingest/issues/112#issuecomment-1329314132
- roman numerals: A function call to convert arabic to Roman numerals (or maybe vice versa? )
- sto…
-
- [ ] Do word to word and phrase to phrase separate files
- [ ] Divide multiple sentences into smaller chunks, may be sentence-wise ?
- [ ] Convert the Jupyter notebook code into pure Python code ?
- …
-
Lemmatization tests on Travis CI succeeded on [Windows](https://travis-ci.com/github/BLKSerene/Wordless/jobs/381823626) and [Linux](https://travis-ci.com/github/BLKSerene/Wordless/jobs/381823628), but…