w3c / iip

Documenting gaps and requirements for support of Indic languages on the Web and in eBooks.
https://w3c.github.io/iip/
8 stars 15 forks source link

Devanagari: 4.2 Hyphenation - Add rules for hyphenation from existing documentation #41

Open alolita opened 5 years ago

alolita commented 5 years ago

https://w3c.github.io/iip/gap-analysis/deva-gap.html#hyphenation

Hyphenation is used in Devanagari today.

Hyphenation rules need to be added based on C-DAC documentation. @akshatsj, @nehagk - please add to this issue based on existing documented rules.

nehagk commented 5 years ago

We could not find any authoritative document on Hyphenation. However one clear requirement for within word hyphenation is that it must happen after the Akshara boundary and never breaks with in between the Akshara

vivekpani commented 5 years ago

I think it will be good to try getting the details in CDAC. Leap and ISM did implement hyphenation very well for all supported languages. The akshara definition may not be the best considering that Unicode introduces ZWNJ and ZWJ which may create confusions about boundaries for visual breaks vs logical breaks for hyphenation.