rosettatype / hyperglot

Hyperglot: a database and tools for detecting language support in fonts
http://hyperglot.rosettatype.com
GNU General Public License v3.0
162 stars 22 forks source link

Buginese Latin orthography uses Cyrillic Schwa/schwa. It should probably use the corresponding Latin codepoints. #77

Closed MrBrezina closed 2 years ago

MrBrezina commented 2 years ago

From @SorkinType:

For the language Bugis/Buginese it looks like you have Cyrillic capital letter Schwa U+04D8 and Cyrillic small letter schwa U+04D9 listed as letters used by them (when using Latin script). I was expecting Schwa U+018F and schwa U+0259.

The use of a Cyrillic letter seems odd since the rest of the letters are Latin ones. No?

I tried this evening to find some references online that might support or disprove this information but I am coming up with nothing. Do you have any light to shed on this mystery? Any references?

image
MrBrezina commented 2 years ago

@kontur fixed imho

kontur commented 2 years ago

Looking at commit 62721ea I am not sure it is. Now bug seems to link to zsm which doesn't have Schwa altogether. The original issue seems valid.

kontur commented 2 years ago

Also first time I learn of Schwa having a Latin and a Cyrillic code point. Could write a test for that (and possibly other similar ambivalent glyph encodings).

MrBrezina commented 2 years ago

See line 1744 before and after

kontur commented 2 years ago

Roger that, didn't dig that deep. Let's keep this issue open until updated to pip 👍