giellalt / lang-sme

Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Northern Sami language
https://giellalt.uit.no
GNU General Public License v3.0
6 stars 1 forks source link

Missing mwe disambiguation for `oaččuiba`, error not detected #76

Open snomos opened 11 months ago

snomos commented 11 months ago

Input:

Jođiheaddji guovttosges oaččuiba jo ánsoruossa, mii lea čuovvovaš ánsu maŋŋá gollemedálja.

Command:

echo 'Jođiheaddji guovttosges oaččuiba jo ánsoruossa, mii lea čuovvovaš ánsu maŋŋá gollemedálja.' | \
./tools/grammarcheckers/modes/smegramrelease.mode

Result:

WARNING: Line 6: Some but not all main-readings of "<oaččuiba>" had wordform-tags (not completely mwe-disambiguated?), not splitting.
"<oaččuiba>"
    "oažžut" <mv> V <TH-Inf> <TH-Acc-Any><mielde> <TH-Acc-Any><árvvus> <TH-Acc-Any><mátkái> <TH-Acc-Any><fápmui> <TH-Acc-Any><doibmii> <TH-Acc-Any><johtui> <TH-Acc-Any><OR-Loc-Any> <TH-Acc-Any><SO-Loc-Any><DE-Ill-Any> <TH-Acc-Any><DE-Ill-*Ani> <AG-Acc-Ani><TH-Inf> <TH-Inf> <TH-Acc-Any><MA-Ill-áigi> <TH-Acc-Any> <Inf> TV Ind Prt Du3 Err/Orth <W:0.0> @+FMAINV #3->3
    "ba" Pcle Foc/ba <W:0.0> "<ba>" @PCLE #3->3
        "oažžut" V <Inf> TV Ind Prt Sg3 <W:0.0> "<oaččui>" #3->3

Ie a grammar error goes undetected because we are left with an ambiguous token.