lemmatization Search Results

1000+ results
for lemmatization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ontoportal-lirmm/ncbo_annotator #4

Normalize the annotator input and dictionnary with TreeTager

This task consists in using TreeTager to normalize the text being sent to the Annotator and therefore also use it to normalize the content of the dictionary. This task is divided into 3 specific iss…

jonquet updated 4 years ago
18
jwzimmer-zz/aboutvsof #2

what does the language in a conspiracy theory conference loo…

Re https://github.com/jwzimmer/aboutvsof/issues/1#issuecomment-758165911, there may be a pretty simple way to tell conspiracy theory speech (I am not sure what to even call this... I mean "nonsense"?)…

jwzimmer-zz updated 3 years ago
18
JetBrains-Research/pubtrends #26

Subtopics naming - stemming + correct n-gramms processing

Most of the time the naming is like the following:

olegs updated 4 years ago
2
bnosac/udpipe #59

plot dependency parsing

``` library(udpipe) library(igraph) library(ggraph) library(ggplot2) plot_annotation

jwijffels updated 4 years ago
1
mlopatka/CANOSP2020 #69

Harmonize text preprocessing workflows

Each of the different exploratory directions are using a lot of manual regex-based data cleaning. We should consolidate that code into a utils file and use consistent prepossessing across all our dif…

mlopatka updated 4 years ago
7
crosswire/xiphos #990

Misleading proximity of Strong's tags to the next line

When (e.g.) **Strong's numbers** are enabled in module settings, the numbers are closer to the next line than to the line they actually belong to. This is misleading to some extent. Enabling **D…

DavidHaslam updated 4 years ago
6
UniversalDependencies/UD_Portuguese-Bosque #4

Differences in tokenization and POS

We need to train treeler in a way that matches the way Freeling emits the tokens, in terms of tokenization, lemmatization, and POS tagging. Otherwise we will train for something that will never appe…

fcbr updated 4 years ago
1
Ciphey/Ciphey #90

Language detection interface

### Problem Whilst the original language checker is absolutely brilliant, it fails at small ciphertexts, or those with high entropy. An AI solution would be cool, but would be a bit OTT for rigid dat…

Cyclic3 updated 4 years ago
14
WheatonCS/Lexos #1013

Scrubber form values are not submitted

From what I can tell, the form object in Scrubber is not submitted at all, so it is just running defaults. This is obvious if you uncheck "Make Lowercase". It also explains issue #1011 and issue #1005…

scottkleinman updated 4 years ago
6
explosion/spaCy #5095

LEMMA 'learn' doesn't match with `learning`

## How to reproduce the behaviour ```python import spacy from spacy.matcher import Matcher nlp = spacy.load('en_core_web_sm') matcher = Matcher(nlp.vocab) pattern = [{"LEMMA": "learn"}] mat…

chingan-tsc updated 4 years ago
4

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for lemmatization

1000+ results
for lemmatization