-
Some older platforms (older android and older ubuntu touch, possibly also kaiOS) have problems with unicode emoji rendering even when updating the emoji fonts.
To solve that we should add a mode (n…
-
#4311 add the `text-transform` property. Relevant spec: http://dev.w3.org/csswg/css-text/#propdef-text-transform
Follow-ups include:
- Case mapping should include "full" and "special" mappings that c…
-
Does this issue occur when all extensions are disabled?: Yes
- VS Code Version: 1.78.1
- OS Version: macOS 13.3.1
Steps to Reproduce:
1. Put consecutive flags e.g. 🇩🇲🇩🇴 in…
-
### Background and motivation
There's an existing API, `StringInfo.GetTextElementEnumerator`, which allows us to enumerate textual elements (grapheme clusters, or in other words, the individual cha…
-
What is the string length of UTF-8 strings? Is it the number of characters or the number of UTF-8 code points?
Example:
```mo
model M
Integer len = Modelica.Utilities.Strings.length("日本語!")
…
-
from https://tex.stackexchange.com/questions/521431/fontspec-shifts-some-diacritics-marks-when-letterspacing
letterspacing doesn't work well with char + combining accents.
~~~~
\documentclass{a…
-
The character count currently uses `string.length` to establish the length of the user input. `string.length` counts code units, not characters, and this can lead to some confusing results when using …
-
Currently, string reversal works on code points, it doesn't care what kind. So it won't reverse strings containing combining characters properly.
We could quite simply implement the [Missy Elliot alg…
-
Currently, the tokenization process is considering a single `u8` as a character - this is fine if everything in the input is ASCII, but ZOMB files are UTF-8 encoded, so we need to update this to proce…
Zooce updated
3 years ago
-
Currently `Splitter.fixedLength` splits strings based on chars. It would be nice if the splitter had a configurable encoding, such as utf-8, for codepoints.