w3c / tlreq

Tibetan Layout Requirements document
https://www.w3.org/International/tlreq/
Other
11 stars 9 forks source link

How are word boundaries detected in Tibetan? #15

Open r12a opened 6 months ago

r12a commented 6 months ago

Tibetan has no delimiter for word boundaries (only for tsheg-bar boundaries), but i'm led to believe that the preference is to wrap lines at word boundaries where possible.

Is this something that applications currently do? If so, how do they detect the word boundaries? Do they use dictionary lookups like Chinese, Thai, etc?

nickscottprior commented 5 months ago

i'm led to believe that the preference is to wrap lines at word boundaries where possible

The preference for Tibetan? I don't think this is common. Every genre and format of text that I've read from Old Tibetan to the modern day flagrantly breaks words across different lines all the time.