silnrsi / sldr

SIL Locale Data Repository
MIT License
19 stars 11 forks source link

Avoyaka [avu] exemplar characters issues #29

Closed moyogo closed 1 year ago

moyogo commented 1 year ago

sldr/a/avu_Latn_SD.xml and sldr/a/avu_Latn_SD.xml have {ı̣} in the main exemplar characters. It should be as listed in Eileen Kilpatrick, "Orthographies of Moru-Maʼdi languages", Occasional Papers in the Study of Sudanese Languages No. 9, p. 88 or Rhonda Hartell, Alphabets of Africa, 1993. The Avoyaka orthography seems to always indicate tone (with acute, circumflex, tilde) and and {ı̣} are visually identical with accents, which may be the cause of the issue.

emily-roth commented 1 year ago

Thank you so much for raising this issue!

Your guess as to the source of the issue was correct: In several fonts (sometimes depending on if they are using subsetting or not),{ị} will keep its upper dot even when diacritics are added, which obviously was not intended. In the Bible Translation published in 2002 (Tã-drı̣̃ Lẽlẽ Óꞌdí Óvârí Kâ), they got around the issue by using {ı̣} to make sure the upper dot didn't interfere with the diacritics. Clever solution, but still incorrect for the purposes of the SLDR, and most likely the source of the original incorrect data.

I updated the file in the SLDR accordingly and it should be reflected in the master branch of the repo at this point. It'll update everywhere else when we do our next release of the SLDR, which should hopefully be sometime this coming November/December. Thank you again!