Open r12a opened 4 years ago
The first comment in this issue contains text that will automatically appear in the Tamil gap-analysis document as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.
Discussing with Richard, it seems that Prohibited Break in UAX #14
should be used for this kind of usage.
In general, there are no issues with ordinary Tamil line-breaking. Line-break opportunities occur at the spaces between words, and characters that are not supposed to appear at a line start do not.
The exception to the latter is that । [U+0964 DEVANAGARI DANDA] and ॥ [U+0965 DEVANAGARI DOUBLE DANDA] will wrap alone to the beginning of a line if there is a space between them and the previous word. This is the case for all browsers.
Test: A line should not start with a danda character even if it is separated from the previous word by a space.
The impact of this is advanced, although it would be good to fix it.