giellalt / lang-sme

Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Northern Sami language
https://giellalt.uit.no
GNU General Public License v3.0
6 stars 1 forks source link

compound splitting does not work (Bugzilla Bug 2702) #442

Open albbas opened 4 years ago

albbas commented 4 years ago

This issue was created automatically with bugzilla2github

Bugzilla Bug 2702

Date: 2020-11-03T13:35:59+01:00 From: Linda Wiechetek <> To: Tommi A Pirinen <> CC: linda.wiechetek, sjur.n.moshagen, trond.trosterud, unhammer+apertium

Last updated: 2020-11-09T14:10:41+01:00

albbas commented 4 years ago

Comment 14109

Date: 2020-11-03 13:35:59 +0100 From: Linda Wiechetek <>

I'm not sure where in the pipeline the error is, so I chose grammarcheckers as the category. When analyzing the following sentence, "oahppan giela" gets analyzed as a potential compound. The compound reading with an Err/Space tag is removed, but the compound is not split!! Why is that so? Usually there are two alternatives for potential compounds:

  1. it is analyzed as one word with Err/Space
  2. it is split

Here neither of those is the case. I'm lost.

Máhtten unnán dárogiela dalle go evakuerejuvvuimet, muhto ledjen juo oahppan giela oalle bures dáluin gos leimmet ovdal go álgen skuvlii.

"" "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppat" V TV PrfPrc "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppat" V TV Ind Prt ConNeg "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppat" V TV Actio Nom "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppat" V TV Actio Gen "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppat" Ex/V TV Der/NomAct N Sg Nom "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppat" Ex/V TV Der/NomAct N Sg Gen "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppan" N Sem/Act Sg Nom "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppan" N Sem/Act Sg Gen Allegro "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppa" N Sem/Edu Sg Nom PxSg1 "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppa" N Sem/Edu Sg Gen PxSg1 "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppa" N Sem/Edu Sg Acc PxSg1 "" giellalt/bugzilla-dummy#10->10 "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 "oahppa" N Sem/Edu Ess "" giellalt/bugzilla-dummy#10->10 ; "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 ; "oahppa" N Sem/Edu Sg Gen Err/Orth PxSg1 "" giellalt/bugzilla-dummy#10->10 REMOVE:8599:SuperfluousErrTags ; "giella" N Sem/Lang_Tool Sg Acc "< giela>" @ giellalt/bugzilla-dummy#10->10 ; "oahppa" N Sem/Edu Sg Acc Err/Orth PxSg1 "" giellalt/bugzilla-dummy#10->10 REMOVE:8599:SuperfluousErrTags ; "oahppangiella" N Sem/Lang Sg Acc Err/SpaceCmp SELECT:3681:r683 ; "oahppangiella" N Sem/Lang Sg Gen Allegro Err/SpaceCmp REMOVE:3677 ; "oahppangiella" N Sem/Lang Sg Gen Err/SpaceCmp SELECT:3681:r683 ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppat" V TV PrfPrc "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppat" V TV Ind Prt ConNeg "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppat" V TV Actio Nom "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppat" V TV Actio Gen "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppat" Ex/V TV Der/NomAct N Sg Nom "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppat" Ex/V TV Der/NomAct N Sg Gen "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppan" N Sem/Act Sg Nom "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppan" N Sem/Act Sg Gen Allegro "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppa" N Sem/Edu Sg Nom PxSg1 "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppa" N Sem/Edu Sg Gen PxSg1 "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppa" N Sem/Edu Sg Gen Err/Orth PxSg1 "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppa" N Sem/Edu Sg Acc PxSg1 "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppa" N Sem/Edu Sg Acc Err/Orth PxSg1 "" ; "giellat" V TV Ind Prs ConNeg Err/Orth "< giela>" SELECT:3681:r683 ; "oahppa" N Sem/Edu Ess "" ; "giellat" V TV Ind Prs ConNeg "< giela>" SELECT:3681:r683 ; "oahppat" V TV PrfPrc "" ; "giellat" V TV Ind Prs ConNeg "< giela>" SELECT:3681:r683 ; "oahppat" V TV Ind Prt ConNeg "" ; "giellat" V TV Ind Prs ConNeg "< giela>" SELECT:3681:r683 ; "oahppat" V TV Actio Nom "" ; "giellat" V TV Ind Prs ConNeg "< giela>" SELECT:3681:r683 ; "oahppat" V TV Actio Gen "" ; "giellat" V TV Ind Prs ConNeg "< giela>" SELECT:3681:r683 ; "oahppat" Ex/V TV Der/NomAct N Sg Nom "" ; "giellat" V TV Ind Prs ConNeg "< giela>" SELECT:3681:r683 ; "oahppat" Ex/V TV Der/NomAct N Sg Gen "" ; "giellat" V TV Ind Prs ConNeg "< giela>" SELECT:3681:r683 ; "oahppan" N Sem/Act Sg Nom ""

albbas commented 4 years ago

Comment 14119

Date: 2020-11-09 14:10:41 +0100 From: Linda Wiechetek <>

There are more examples of this:

"<oahpahus guovddáš fáddat>" "oahpahusguovddášfádda" N Err/Lex Sem/Semcon Err/Orth Sg Nom PxSg2 Err/Spa ceCmp SELECT:2380:guovddáš &msyn-compound &typo giellalt/bugzilla-dummy#5->5 ADD:3823:compound AD D:3823:compound ADD:3823:compound ADD:8653:Err/Orth-any ADD:8753:other-errors msyn-compound typo "oahpahusguovddášfádda" N Err/Lex Sem/Semcon Sg Nom PxSg2 SELECT:2 380:guovddáš &SUGGEST giellalt/bugzilla-dummy#5->5 ADD:3823:compound ADD:3823:compound ADD:3823:compound COPY:3841:compound oahpahusguovddášfádda+N+Err/Lex+Sg+Nom+PxSg2 ? "oahpahusguovddášfádda" N Err/Lex Sem/Semcon Sg Nom PxSg2 SELECT:2 380:guovddáš &SUGGEST giellalt/bugzilla-dummy#5->5 ADD:3823:compound COPY:3841:compound oahpahusguovddášfádda+N+Err/Lex+Sg+Nom+PxSg2 ? "oahpahusguovddášfádda" N Err/Lex Sem/Semcon Sg Nom PxSg2 SELECT:2 380:guovddáš &SUGGEST giellalt/bugzilla-dummy#5->5 ADD:3823:compound ADD:3823:compound COPY:3841:compound oahpahusguovddášfádda+N+Err/Lex+Sg+Nom+PxSg2 ? "oahpahusguovddášfádda" N Err/Lex Sem/Semcon Sg Nom PxSg2 Err/SpaceCmp <W: 0.0> SELECT:2380:guovddáš &msyn-compound &real-PlNomPxSg2-PlNom &typo giellalt/bugzilla-dummy#5->5 ADD:38 23:compound ADD:3823:compound ADD:6294:real-PlNomPxSg2-PlNom ADD:3823:compound ADD :8653:Err/Orth-any ADD:8753:other-errors msyn-compound real-PlNomPxSg2-PlNom typo "oahpahusguovddášfádda" N Pl Err/Lex Sem/Semcon Nom Err/SpaceCmp S ELECT:2380:guovddáš &SUGGEST giellalt/bugzilla-dummy#5->5 ADD:3823:compound ADD:3823:compound ADD:6294:re al-PlNomPxSg2-PlNom COPY:6327:real-PlNomPxSg2-PlNom oahpahusguovddášfádda+N+Pl+Err/Lex+Nom+Err/SpaceCmp ? "oahpahusguovddášfádda" N Pl Err/Lex Sem/Semcon Nom Err/SpaceCmp S ELECT:2380:guovddáš &SUGGEST giellalt/bugzilla-dummy#5->5 ADD:3823:compound ADD:3823:compound ADD:6294:re al-PlNomPxSg2-PlNom ADD:3823:compound COPY:6327:real-PlNomPxSg2-PlNom oahpahusguovddášfádda+N+Pl+Err/Lex+Nom+Err/SpaceCmp ? "oahpahusguovddášfádda" N Err/Lex Sem/Semcon Sg Nom PxSg2 SELECT:2 380:guovddáš &SUGGEST &real-PlNomPxSg2-PlNom giellalt/bugzilla-dummy#5->5 ADD:3823:compound ADD:3823:comp ound ADD:6294:real-PlNomPxSg2-PlNom ADD:3823:compound COPY:3841:compound oahpahusguovddášfádda+N+Err/Lex+Sg+Nom+PxSg2 ? "oahpahusguovddášfádda" N Pl Err/Lex Sem/Semcon Nom SELECT:2380:gu