lemmatization Search Results

1000+ results
for lemmatization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ljvmiranda921/calamanCy #31

Lemmatizer

Hi, is there something like a lemmatizer? I have a couple of tagalog sentences with translations and I am trying to lemmatize them (then do some sorting by frequency and then use it myself for languag…

wadid updated 11 months ago
3
Ejhfast/empath-client #4

Stemming or Lemmatize?

Hi, Is there a way to incorporate stemming or lemmatization? The problem is, for example, while the word 'help' in a text gets counted towards category help, the words helping and helped do not. An…

santoshbs updated 4 years ago
1
dgarrick/headliner #38

Merge clusters like "trumps" if "trump" exists. Otherwise do…

campbellcompton updated 6 years ago
3
dotnet/machinelearning #5281

Request : Apply Lemma / stemming in FeaturizeText options

Hi First Thank you for all the work done, i know that FeaturizeText apply NLP preprocessing like skipword with a specifique language : ![image](https://user-images.githubusercontent.com/16559628/86…

ErwanL08 updated 11 months ago
10
mmcs-ruby/sentiment #5

Find/Create function for text tokenzation

Find/Create function for sepatating text string to tokens(words). Function must get text string and return list of string tokens. Function also **should not** return tokens containing digits and pu…

AndreyKondakovGW updated 2 years ago
4
HaraHeique/TCC-rede-neural-siamesa #34

Criar 3 versões de dataset tanto para treinamento quanto par…

Versões: 1. Datasets limpos (sem lematização e sem remoção de SW); 2. Datasets com remoção de SW; 3. Datasets com remoção de SW e lematização (ambos usando NLTK como é feito atualmente). [Trea…

HaraHeique updated 3 years ago
1
cltk/cltk #1194

A way to tell what tokens `LatinBackOffLemmatizer()` has fai…

In `LatinBackOffLemmatizer()` and the lemmatizers in its chain I can't seem to find an option to return an empty value (such as in `OldEnglishDictionaryLemmatizer()`'s `best_guess=False` option), inst…

langeslag updated 1 year ago
6
NatLibFi/Annif #539

Stanza analyzer

Just like with spaCy (see #374), we could add an analyzer that uses the [Stanza](https://stanfordnlp.github.io/stanza/) (formerly StanfordNLP) NLP toolkit for tokenization and especially lemmatization…

osma updated 2 years ago
1
ziqizhang/jate #49

Example SOLR configuration for German text corpus?

For somebody not familiar with SOLR it is very hard to start using this. Would it be possible to add an example configuration for processing a corpus where each document is just a text file for the …

johann-petrak updated 4 years ago
1
avashishta5/Sankshep #15

Lemmatizer Enhancements

- [ ] Add POS Tagging to exclude nouns from lemmatization and for better sanitization. - [ ] Replace regular Levenshtein distance with a Levenshtein Automaton + Jaro-Winkler Distance based approa…

avashishta5 updated 4 years ago
1

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for lemmatization

1000+ results
for lemmatization