lemmatisation Search Results

246 results
for lemmatisation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

msg-systems/coreferee #31

Adding support for trf models without NER

Hello, As I am approaching the end of my enterprise to add support for french, I notice that the performance of the coreference resolution is a lot hampered by the performance of the used spacy mod…

Pantalaymon updated 2 years ago
2
explosion/spaCy #8705

Danish trf wordpiece tokenisations strips accent and lack of…

## How to reproduce the behaviour The Danish transformer strip accent leading to the same wordpieces of meaningfully different words. ``` >>> import spacy >>> nlp = spacy.load('da_core_news_tr…

KennethEnevoldsen updated 3 years ago
7
biblissima/collatinus #67

petites erreurs dans lemmes.la et lem_ext.la

supprimer les parenthèses ouvrantes dans le participe et les génétifs, et le point d'exclamation dans le lemme ``` concustōdĭo=cōncūstōdĭo|audio|cōncūstōdīv|cōncūstōdī(|is, ire, iui, itum|2 Aether2…

mcorne updated 2 years ago
4
explosion/spaCy #7347

Spacy 3.0 no longer assigns expected lemmas to contractions …

Spacy 2.3, `en-core-web-lg` “I can't go”: (orth / lemma) ``` I / -PRON- ca / can n't / not # HERE go / go ``` Spacy 3.0.2, `en-core-web-lg` ``` I / I ca / ca # HERE n't / n't # HERE…

adam-ra updated 3 years ago
3
eubinecto/idiomatch #9

The lemmas of idioms should not contain whitespaces

## The problem As of right now, the lemmatised string of idioms contains whitespaces (except the hyphenated ones) like so: ``` tokenisation: ['You', 'are', 'down to earth', '.'] lemmatisation: […

eubinecto updated 3 years ago
1
stanfordnlp/stanza #701

Wrong Dutch lemmatisation even if not present in training se…

I was doing some basic parsing tests and found that a very mundane word was lemmatised incorrectly. The Dutch word eten ("to eat") is incorrectly lemmatised as "emmen" when given in its singular form.…

BramVanroy updated 3 years ago
6
oracc/nisaba #33

Investigate logging options

See also #26, putting here for more focused discussion. We may need to use different logging mechanisms to show information to users and to store for later debugging. - What do we need to log and …

ageorgou updated 3 years ago
1
clarin-eric/ParlaMint #102

SI: Lemmas for hyphenated words are wrong

Lemmas for hyphenated words replace the hyphen with a "v". I.e. "gozdno-lesen" --> "gozdnovlesen" (see document ParlaMint-SI_2020-06-15-SDZ8-Redna-18). This is a common occurrence in the Slovenian…

ajdapretnar updated 3 years ago
3
TEIC/TEI #2113

Spelling under 3.8.2.2 in Guidelines

Above the first Arabic lemmatisation example: supplied by an term elements → a term element, I presume.

Dominique-M updated 3 years ago
1
obsei/obsei #75

Add text cleaner node

Idea to have configurable text cleaning node. This node also have predefined template to clean tweets, facebook feed, app reviews etc. For detail refer https://github.com/lalitpagaria/obsei/issues…

lalitpagaria updated 3 years ago
4

上一页 1...10 11 12 13 14 15 16...25 下一页

246 results for lemmatisation

246 results
for lemmatisation