opener-project / pos-tagger-en-es

POS tagger for English, Spanish, Dutch and other languages.
Other
5 stars 1 forks source link

Lemmatization error #3

Open DVD27 opened 8 years ago

DVD27 commented 8 years ago

Hello,

I have a problem with a french Kaf document : <KAF xml:lang="fr" version="v1.opener"><kafHeader><fileDesc/><linguisticProcessors layer="text"><lp name="opener-sentence-splitter-fr" version="0.0.1" timestamp="2016-02-11T07:21:24Z"/><lp name="opener-tokenizer-fr" version="1.0.1" timestamp="2016-02-11T07:21:24Z"/></linguisticProcessors></kafHeader><text><wf wid="w1" sent="1" para="1" offset="0" length="2">je</wf><wf wid="w2" sent="1" para="1" offset="3" length="2">n'</wf><wf wid="w3" sent="1" para="1" offset="5" length="4">aime</wf><wf wid="w4" sent="1" para="1" offset="10" length="3">pas</wf><wf wid="w5" sent="1" para="1" offset="14" length="3">les</wf><wf wid="w6" sent="1" para="1" offset="18" length="6">crêpes</wf></text></KAF>

When I use the webservice OpeNER pos-tagger I obtain : `jen'aimepaslescrêpes

` We see that the lemma for "aime" is bad it should be "aimer" excepted we have "aimer___love__1" It is strange.