Open fbanados opened 2 weeks ago
Here's where this is discussed for the FST side:
This issue also affects the merging of entries: e.g. AECD has a yôwênam
entry, while CW uses ýowênam
. If the merging process does not handle this, definitions are left out.
Agreement is that the strict analyzer FST should be slightly relaxed to accept ý -> y
. Eventually, also the generator FST should generate ý
whenever the data used to build the FST supports it (that is, not every y shall become ý). In this way, main entries with ý can preserve the letter in their identifier, and thus itwewina can later implement an option to either show or hide the ý in the presentation.
We can have an optional conversion of ý (->) y
on the analysis side of the strict (normative) analyzing FST.
I've recompiled the FSTs and these have been deployed to the dev version for testing.
ý
characters are used in CW to denote a sound that change between different Cree dialects. For itwewina, these must be changed toy
. We need a more consistent and future-proof way of handling these both in thecrk-db
repo and inmorphodict
.See also #44, #30, #68, UAlbertaALTLab/morphodict#929, UAlbertaALTLab/morphodict#649, UAlbertaALTLab/morphodict#197, UAlbertaALTLab/morphodict#255, https://github.com/UAlbertaALTLab/morphodict/issues/96#issuecomment-552063925 https://github.com/UAlbertaALTLab/itwewina/issues/104#issuecomment-449160563, https://github.com/UAlbertaALTLab/crk-db/blob/9278ba490909d07ed5e3b41405bf60dfc0126687/lib/convert/CW.js#L48,
ý
toy
for itwewina morphodictcrk-db
utilizes correct spellings for merging with AECD (otherwise entries are removed)