notofonts / noto-fonts

Noto fonts, except for CJK and emoji
http://fonts.google.com/noto
SIL Open Font License 1.1
2.45k stars 200 forks source link

Automatic Arabic ligatures #411

Closed shawnl closed 9 years ago

shawnl commented 9 years ago

https://en.wikipedia.org/wiki/Arabic_script_in_Unicode#Word_ligatures

The above ligature presentation forms were deprecated by the most recent UNICODE version, (and these ligatures are in common usage) however no existing fonts automatically combine these series of code-points into ligatures. (Actually Amiri appears to combine Allah/الله)

Either Noto should include these presentation form code-points (as these ligatures are in very common use), or the font should be sophisticated enough to render the ligatures automatically when the NFKD form for them is found.

khaledhosny commented 9 years ago

The Arabic Presentation Forms have been deprecated for a long time (probably since the day they were encoded as they are compatibility characters), I doubt that they are in common usage by any measure. Fonts like Amiri has many “ligatures(1)” because the style of the font requires them, but they are unrelated to those encoded characters, and simplified designs like Noto Arabic Naskh do not need them most of time (though I personally thing it can make use of some contextual alternates especially around final Yeh, but that is a different matter).

(1) Amiri does not actually include any ligatures, it has contextual alternates for specific combinations.

shawnl commented 9 years ago

While the code-points are not common, the ligatures are. Without ligatures(1) provided by the fonts, applications will continue to use images for this purpose, which is non-optimal.

(1)or contextual alternatives

khaledhosny commented 9 years ago

Do you have examples of such use on the wild?

shawnl commented 9 years ago

http://www.mujahideenryder.net/pdf/WhoAretheDisbelievers.pdf

https://www.reddit.com/r/islam/ (see right side-bar)

p.1 of the Arabic-English Al-Ghazali's The Incoherance of the Philosophers, translated by Michael Marmura, BYU (a wide ﷽)

Mostly religious stuff with ﷺ/صلى الله عليه وسلم

khaledhosny commented 9 years ago

Noto Arabic Naskh already has ﷺ, so what you is missing is only a wide ﷽, since other symbols in your examples don’t have Unicode points at all.

shawnl commented 9 years ago

OK. So then Unicode should make these characters subject to canonical decomposition, rather than compatibility decomposition. Would that not be a better method of dealing with the multiple forms, so that a typesetter can choose a decomposed الله in Amiri if they want? http://www.unicode.org/reports/tr15/ Where can I file that bug?

How did you do a footnote in markdown when you closed the bug?(1)

(1)Like this

brawer commented 9 years ago

Unicode should make these characters subject to canonical decomposition, rather than compatibility decomposition. [...] Where can I file that bug?

http://www.unicode.org/reporting.html

behdad commented 9 years ago

So then Unicode should make these characters subject to canonical decomposition, rather than compatibility decomposition.

No. You are wrong. Those characters don't need to be in Unicode. Ligatures are formed out of glyphs. Just listen to whatever Khaled says. He knows what he's saying.