-
### Description
Apache OpenNLP functionality has been available in Lucene starting with [v7.3.0](https://lucene.apache.org/core/7_3_0/analyzers-opennlp/index.html). Based on a request from one of my…
-
hello world
please i nedd you help to prepare data.txt to use it in skipgram training
how i can prapare the text
what do you think if i use preprocessing text( Lowercasing Stemming Lemmatization …
-
can I user your program as a stemmer or/and lemmatizer for Kazakh language?
-
**Is your feature request related to a problem? Please describe.**
Pluralizing english words is useful for many things:
- Map more easily codebase with collections, database tables, etc.
- e.g.…
-
https://github.com/vgorman1/Greek-Dependency-Trees
kasev updated
3 years ago
-
I think a word can't be marked both RA if it is the relative pronoun, right?
100213 RA ----NPM- οἵ οἵ οἵ ὅς
-
Asia
ATV
Seadoo
Christmas
bmx
jeep
---
African
Arabic
frisbee considered proper by PWN
-
Although the stream is read correctly (and lemmatization works), the following is reported on the console:
_Exception when deserializing Lemmatizer: System.IO.EndOfStreamException: Unable to read bey…
-
Currently, there is no way in the UD English treebanks to differentiate between adjectives that refer to common nouns and those that refer to proper nouns -- both are annotated as `ADJ+JJ`.
This ma…
-
**Background**
- The literature seems unclear on what similarity metrics perform best for diversity and relevancy. (if anyone has found any good analysis on this would be great to see).
- bm25 wor…