apertium / apertium-grn

Apertium linguistic data for Guarani
GNU General Public License v3.0
2 stars 2 forks source link

Correct spelling of "kuérape" has the wrong weight #25

Open ftyers opened 5 years ago

ftyers commented 5 years ago
$ echo "kuérape" | hfst-lookup grn.autoseg.hfst 
kuérape kuerá>pe    1,000000
kuérape kuéra>pe    2,000000

The reason is:

$ echo "kuérape" | hfst-lookup grn.automorf.hfst 
kuérape kuera<adj>+pe<post> 0,000000
kuérape kue<n>r<det>+tape<n>    1,000000
kuérape kuera<adj>+pe<post> 1,000000
kuérape kuera<v><iv>+pe<post>   1,000000
kuérape kuera<v><iv><pres>+pe<post> 1,000000
kuérape kuéra<det><pl>+pe<post> 1,000000

Which in turn comes from:

^,/,<cm>$
^ko/ko<adj><dem><sg>$
^karu/karu<n>$
^guasurã/guasu<adj>+rã<subs>$
^,/,<cm>$
^ajapóva/a<prn><p1><sg><ac>+japo<v><tv><pres>+va<subs><dem>$
^Israelgua/Israel<np><top>+gua<post>$
^kuérape/kuera<adj>+pe<post>$

Is this the right analysis? If so, perhaps we should include some examples of kuérape in the corpus.