n8willis / opentype-shaping-documents

Documentation of OpenType shaping behavior
171 stars 14 forks source link

Telugu + Kannada nakara-pollu #116

Open n8willis opened 3 years ago

n8willis commented 3 years ago

I was revisiting the Telugu chapter of Unicode and noticed the paragraph on nakāra-pollu, which I didn't remember from last time....

This isn't covered in this repo's docs, so I'm wondering if it should be. Specifically, the way Unicode Standard frames it, the sequence "Na,Halant" (U+0C28,U+0C4D) can have either of two forms (which is fine, and up to the font AFAICT), but it mentions specifically

"The character U+200C zero width non-joiner can be used to prevent interaction of this sequence with following consonants, as shown in Table 12-31."

and give two examples where the sequence is "Na,Halant,ZWNJ,Da" (U+0C28,U+0C4D,U+200C,U+0C26). Seems that this is a standard prevent-conjunct-formation usage, but if that's true, I'm curious why it is given a separate discussion (and table of illustrations). Is there something needing special treatment here?

n8willis commented 3 years ago

In Unicode 14, Telugu and Kannada have both gained an atomically-encoded nakaara-pollu codepoint. Changing the issue title to track....

n8willis commented 2 years ago

Further discussion here: https://unicode.org/L2/L2013/13228-kannada-nakaara-pollu.pdf

It looks to me like the ZWNJ usage here does not alter the logic unless the font does something out-of-the-ordinary with GSUB ligation to substitute in the nakara-pollu form. But it may be worth putting in a mention of it as we do for other alternate forms of consonants. Or maybe an illustration.