lemmatization Search Results

1000+ results
for lemmatization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

explosion/spaCy #10953

Problems and errors in new German lemmatizer (since 3.3.0)

For some context, here was the master issue for problems in lemmatization for the lookup-based lemmatizer for German: https://github.com/explosion/spaCy/issues/2486 And here was the announcement that …

lutz-100worte updated 1 year ago
6
CIRCSE/LEMLAT3 #15

Grouping identical analyses into a single entry

Create a filter/function to group identical analyses into a single entry. For example, analyses `18` and `19` of `forma` (Du Cange) are identical: ``` ============================ANALYSIS 18======…

gfranzini updated 5 years ago
1
manticoresoftware/manticoresearch #707

Looks like wordforms don't indexed right with index_exact_wo…

**Describe the bug** According to the [documentation](https://manual.manticoresearch.com/Creating_an_index/NLP_and_tokenization/Morphology#index_exact_words) and Manticoresearch team comments, opti…

asegrenev updated 1 year ago
1
UniversalDependencies/docs #994

Annotations for adjectives referring to proper nouns vs comm…

Currently, there is no way in the UD English treebanks to differentiate between adjectives that refer to common nouns and those that refer to proper nouns -- both are annotated as `ADJ+JJ`. This ma…

rhdunn updated 5 months ago
15
chartbeat-labs/textacy #323

Return keyterm positions in original document when performin…

### context I'm looking to get the original token positions of keyterms when performing keyterm extraction with e.g. TextRank, but this can apply to the other extractors. Example: ```python >>> d…

ChrisJBlake updated 3 years ago
2
qanastek/ANTILLES #1

verbs ending in `-issions` tokenized incorrectly

When using a model like `qanastek/pos-french-camembert`, a verb such as `finissions` results in multiple tokens with VERB entities like `["fini" VERB", "ssions" VERB]`. This does not happen with the f…

joprice updated 3 months ago
3
Brown-University-Library/OLD-ARCHIVED_iip-production #136

Make Word Lists work with new Latin data

The Word List part of the IIP website is generated by code that is in this repository (iip-production). There is a `wordlist.html` template in the templates directory: https://github.com/Brown-Unive…

emylonas updated 2 years ago
6
azuline/repertoire #55

frontend: search feature

hmm i dont want to write, i write enough at work. so im gonna pull together some unified search table and attach a labeling system. labels will be able to filter search results--this is a data-agnosti…

azuline updated 3 years ago
2
suhasbhairav/JuliaMachineLearning #7

train a model on a lemmatized corpus

1. lemmatize the ukwac+wacky corpus using Jobimify tool: ``` frink:/home/panchenko/jobimify ``` - use the concatenation of these corpora http://cental.fltr.ucl.ac.be/team/~panchenko/d…

alexanderpanchenko updated 9 years ago
13
nltk/nltk #3256

Add functionality to return the lemmas of words used in a co…

I have been working with natural language processing and often needed to know which words were used in certain corpora. Many dictionaries are comprised of word stems, requiring the extraction of stems…

Sion1225 updated 5 months ago
1

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for lemmatization

1000+ results
for lemmatization