giellalt / lang-smj

Finite state and Constraint Grammar based analysers and proofing tools + language resources for Lule Sámi
https://giellalt.uit.no
GNU General Public License v3.0
2 stars 0 forks source link

Downcasing of derived proper nouns broken by hyperminimisation ( #91

Open albbas opened 8 years ago

albbas commented 8 years ago

This issue was created automatically with bugzilla2github

Bugzilla Bug 2145

Date: 2016-01-11T14:22:44+01:00 From: Sjur Nørstebø Moshagen <> To: Sjur Nørstebø Moshagen <> CC: elena.j.paulsen, sjur.n.moshagen, thomas.omma, trond.trosterud

Last updated: 2018-05-09T12:52:55+02:00

albbas commented 8 years ago

Comment 11052

Date: 2016-01-11 14:22:44 +0100 From: Sjur Nørstebø Moshagen <>

./configure --with-hfst --enable-hyperminimisation

gives the following:

[ 1/18][FAIL] Narvijkka+N+Prop+Sem/Plc+Sg+Gen+Der/k+N+Sg+Nom => Missing results: narvijkak [ 1/18][FAIL] Narvijkka+N+Prop+Sem/Plc+Sg+Gen+Der/k+N+Sg+Nom => Unexpected results: Narvijkak

This is because hyperminimisation inserts an extra symbol at the very beginning of the net: @P.LEXNAME.Root@. This symbol breaks the context requirements of the downcasing regex.

Hyperminimisation is not used very much, but can be turned on for speller optimisations without people being aware of or remembering this bug. It should thus be fixed.

albbas commented 8 years ago

Comment 11053

Date: 2016-01-12 08:54:24 +0100 From: Thomas Omma <>

hyperminimisation! :O

albbas commented 6 years ago

Comment 12811

Date: 2018-05-09 12:52:55 +0200 From: Sjur Nørstebø Moshagen <>

This is now finally fixed - mostly. There are still a few regressions for the spellers for words with CmpN/*Left tags, so will keep this open until it is completely fixed.