giellalt / lang-mdf

Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Moksha language
https://giellalt.uit.no
GNU Lesser General Public License v3.0
3 stars 0 forks source link

A symbol missing in Multichar_Symbols ( #4

Closed albbas closed 10 years ago

albbas commented 10 years ago

This issue was created automatically with bugzilla2github

Bugzilla Bug 1808

Date: 2014-01-30T07:48:01+01:00 From: Trond Trosterud <> To: Jack Rueter <>

Last updated: 2014-01-30T12:15:17+01:00

albbas commented 10 years ago

Comment 9040

Date: 2014-01-30 07:48:01 +0100 From: Trond Trosterud <>

Here is a list of missing symbols:

*** Warning: Lost all entries containing the following symbols: 1 E T g m s +AdA +Voc +Superl

The combination 1 E T g m s indicates that some symbols +Et1sg or similarily has been used but not declared.

albbas commented 10 years ago

Comment 9042

Date: 2014-01-30 12:15:17 +0100 From: Jack Rueter <>

This bug has been informative!

++ *** Warning: Lost all entries containing the following symbols: 1 E T g m s +AdA +Voc +Superl ++

What it must mean is that somewhere in the lexc there are the tags: +AdA, +Voc and +Superl with no association to the beginning or ending of an analysis. Since LEXICA used but not defined is a warning/error found elsewhere, we can assume that the debugging necessary involves finding the tags and either add the LEXICON to some starting point, or remove it from the transducer.

Similarly the numerals and characters: "1 E T g m s" might appear when a tag has not been declared as a multichar-symbol in the root.lexc.

The Moksha bug has been resolved in main/langs/mdf. Now similar cleansing can be done in the other languages.