w3c / iip

Documenting gaps and requirements for support of Indic languages on the Web and in eBooks.
https://w3c.github.io/iip/
9 stars 15 forks source link

Dandas are wrapped alone to the beginning of a line #105

Closed r12a closed 3 years ago

r12a commented 4 years ago

Gujarati uses full stop to represent the end of the sentence. However if the user wants to use the danda or double danda, as per to Unicode recommendation, they have to come from Devanagari block of Unicode. Devanagari phrase separator । U+0964 DEVANAGARI DANDA or ॥ U+0965 DEVANAGARI DOUBLE DANDA are encoded in the DEVANAGARI block with the intent that they should be used as common punctuation for all the major scripts of India including Gujarati.

The properties of purna viram and deergh viram should be the same as the properties of FullStop or other punctuation marks, and a new line should not begin with purna viram and deergh viram.

r12a commented 4 years ago

The first comment in this issue contains text that will automatically appear in the Gujurati gap-analysis document as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.

fantasai commented 4 years ago

Seems like a UAX14 issue. Has it been reported to Unicode?

lianghai commented 4 years ago

Dandas are already Break After (BA) in UAX #14 (see section “Dandas”), the class for general closing characters.

A long-standing issue though, is if dandas should be further specified to behave like Exclamation/Interrogation (EX) so a preceding space (it’s a common style to surround dandas with a pair of spaces, like the French way of typesetting question and exclamation marks) doesn’t cause a linebreak either. This issue ticket may be actually referring to that situation.

r12a commented 3 years ago

Closed in favour of https://github.com/w3c/iip/issues/88