giellalt / bugzilla-dummy

0 stars 0 forks source link

Test words are not recognised/accepted (Bugzilla Bug 2031) #542

Open albbas opened 9 years ago

albbas commented 9 years ago

This issue was created automatically with bugzilla2github

Bugzilla Bug 2031

Date: 2015-04-22T12:01:59+02:00 From: Sjur Nørstebø Moshagen <> To: Thomas Omma <> CC: borre.gaup, sjur.n.moshagen, trond.trosterud

Last updated: 2018-05-29T10:51:13+02:00

albbas commented 9 years ago

Comment 10441

Date: 2015-04-22 12:01:59 +0200 From: Sjur Nørstebø Moshagen <>

When running the following test command:

$ pushd $GTHOME/langs/sme/test/tools/spellcheckers/fstbased/hfst; /opt/local/bin/python3.3 $GTHOME//gtcore/scripts/morph-test.py -c -i -v -S hfst --app "/usr/local/bin/hfst-optimized-lookup" --morph ././../../../../../tools/spellcheckers/fstbased/hfst/acceptor.default.hfst --surface ././../../../../../test/tools/spellcheckers/fstbased/hfst/hfst-acceptor-yamls/words_acceptor.default.speller.yaml; popd

12 input words are not accepted:

ANC-reahccut masti-NRK:s st.dieđ-ravgalastin st.dieđ-láikkesbiro guovttenuppelotčoarvvagiin Guovdageainnu-Romssa guovttenuppelotnamagiid goađátcielastuvvanbárteuvsa doppe-garradastimis allonbiertnaiguin anti-dábálaččaide elge

This is the present behavior of the gt-norm analyser:

$ lookup -q src/analyser-gt-norm.xfst ANC-reahccut ANC-reahccut ANC +N+ACR+Cmp-#reahccut+V+IV+Der/NomAg+N+Pl+Nom

masti-NRK:s masti-NRK:s mastat +V+IV+Der/NomAg+N+SgGenCmp+Cmp-#NRK+N+ACR+Sg+Loc masti-NRK:s mastat +V+IV+Der/NomAg+N+SgNomCmp+Cmp-#NRK+N+ACR+Sg+Loc

st.dieđ-ravgalastin st.dieđ-ravgalastin st.dieđ-ravgalastin +?

st.dieđ.-ravgalastin st.dieđ.-ravgalastin st.dieđ.-ravgalastin +?

guovttenuppelotčoarvvagiin guovttenuppelotčoarvvagiin guoktenuppelohkái+Num+SgGenCmp+Cmp#čoarvi+N+Der/t+A+Sg+Com guovttenuppelotčoarvvagiin guoktenuppelohkái+Num+SgGenCmp+Cmp#čoarvi+N+Der/t+A+Pl+Loc

Guovdageainnu-Romssa Guovdageainnu-Romssa Guovdageainnu-Romssa +?

guovttenuppelotnamagiid guovttenuppelotnamagiid guoktenuppelohkái+Num+SgGenCmp+Cmp#namma+N+Der/t+A+Pl+Acc guovttenuppelotnamagiid guoktenuppelohkái+Num+SgGenCmp+Cmp#namma+N+Der/t+A+Pl+Gen

goađátcielastuvvanbárteuvsa goađátcielastuvvanbárteuvsa goađátcielastuvvanbárteuvsa +?

doppe-garradastimis doppe-garradastimis doppe +Adv+Hyph+Cmp#garradit+V+TV+Der/asti+V+Der/NomAct+N+Sg+Gen+PxSg3 doppe-garradastimis doppe +Adv+Hyph+Cmp#garradit+V+TV+Der/asti+V+Der/NomAct+N+Sg+Acc+PxSg3 doppe-garradastimis doppe +Adv+Hyph+Cmp#garradit+V+TV+Der/asti+V+Der/NomAct+N+Sg+Loc

allonbiertnaiguin allonbiertnaiguin allonbiertnaiguin +?

anti-dábálaččaide anti-dábálaččaide anti +N+Hyph+Cmp#dáhpi+N+Der/laš+A+Pl+Ill

elge elge elge +?

albbas commented 9 years ago

Comment 10451

Date: 2015-04-23 10:10:25 +0200 From: Thomas Omma <>

ANC-reahccut masti-NRK:s st.dieđ-ravgalastin ----- changed: mdb-ravgalastin st.dieđ-láikkesbiro ----- changed: mdb-láikkesbiro guovttenuppelotčoarvvagiin ----------- removed, cause derivation has been marked -Spell Guovdageainnu-Romssa --------- this was treated in PLX guovttenuppelotnamagiid ----------- removed, cause derivation has been marked -Spell goađátcielastuvvanbárteuvsa ----- changed: goađátcielastuvvanbárteuvssa doppe-garradastimis allonbiertnaiguin --------- this was treated in PLX anti-dábálaččaide elge ----- changed: elege

don't know why the others don't function