-
"Grapheme cluster" is often the appropriate way to define "character" in a specifications (such as CSS) which care about things readers visually identify as a character.
Maybe the spec should point t…
-
### Discord username (optional)
paperdave.net
### Describe the bug
warp's input buffer is unusable when using certain multi-codepoint emojis. i dont seem to reproduce this for all grapheme cluster …
-
The glossary entry for "orthographic syllable" describes it as a "typographic character unit", which in turn is described as a unit "that is indivisible with respect to a particular typographic operat…
-
The Unicode concept of 'grapheme cluster' currently fails to represent the small number of conjuncts that are used in modern Tamil, ie. kṣa க்ஷ and the two alternative sequences for srī, ஶ்ரீ and ஸ்ர…
-
### Discord username (optional)
paperdave.net
### Describe the solution you'd like?
Certain graphemes such as क्षि, 👩🌾, 🏳️⚧️, 각, and many more are made up of multiple unicode code points. These c…
-
Hey, Are there any plans to implement more advanced text processing facilitates like the ones mentioned above?
pr8x updated
12 months ago
-
Not sure if this is in the scope of the project, but it would be useful to have an accurate way to count grapheme clusters, and I don't know another way to do this in Go.
http://stackoverflow.com/que…
-
The Unicode concept of 'grapheme cluster' currently fails to represent syllabic conjuncts (plus vowels, etc) in scripts like Devanagari. This means that various editing operations, line breaking algor…
-
Are you still interested in implementing grapheme cluster boundaries from UAX #29?
I was looking into porting this: https://github.com/orling/grapheme-splitter/blob/master/index.js
I found your…
-
This issue is applicable to most languages that form conjuncts from consonant clusters using an invisible virama.
A consonant cluster that uses a conjunct (rather than visible virama) should not be…