-
User dolanor@hachyderm.io on Mastodon asked:
```
does it handle grapheme cluster complexity?
```
Some initial investigation reveals that some do, such as `Words()` since it is counting groupin…
-
To reproduce:
```
» "👩👩👦👦" |> Str.toScalars
[128105, 8205, 128105, 8205, 128102, 8205, 128102] : List U32
» "👩👩👦👦" |> Str.graphemes
["👩", "👩", "👦", "👦"] : List Str
```
The `toS…
-
### What problem are you trying to solve?
Selection and caret position are two building blocks for editing text in canvas content. Consider the sequence of dragging out a text selection with a mous…
-
BPO | [30717](https://bugs.python.org/issue30717)
--- | :---
Nosy | @malemburg, @loewis, @terryjreedy, @scoder, @vstinner, @benjaminp, @jwilk, @mcepl, @ezio-melotti, @stevendaprano, @bitdancer, @metha…
-
I thought `graphemes("hi❤️🔥")`
would return the list: `["h", "i", "❤️🔥"]`, a list of grapheme clusters that i could iterate with:
```
l.foreach fn(c)
println(c)
```
which would pri…
-
The Unicode concept of 'grapheme cluster' currently fails to represent the small number of conjuncts that are used in modern Tamil, ie. kṣa க்ஷ and the two alternative sequences for srī, ஶ்ரீ and ஸ்ர…
r12a updated
4 months ago
-
Currently, grapheme clusters (`Characters`) are the only way to manipulate natural languages correctly.
So, I propose syntax for grapheme clusters literals like `g"𠮷野"`.
It might include a proposal …
-
The length limit of VARCHAR in some RDBs is the number of UTF-32 code points.
maxLength counts an emoji and some kanji as two.
Password requirements by NIST:
https://pages.nist.gov/800-63-3/sp8…
-
### Proposal Details
I propose to add the functions Graphemes and GraphemesReversed to the packages ~~strings and bytes~~ unicode [Edited by adonovan, Oct 8].
#14820 proposes to add such functiona…
-
### Related problem
Related: #14174 - The `seq char` command doesn't work well for non-alpha characters.
Rather than fixing `seq char`, what if we deprecate it (or move a new version to the Stan…