w3c / string-search

Parking lot for advice on internationalization related string searching in general content
https://w3c.github.io/string-search/
3 stars 10 forks source link

IVS in string searching #21

Open xfq opened 10 months ago

xfq commented 10 months ago

In an IVS, when VS and the previous code point are displayed and used, they should be treated as one "character" when displayed. When doing string searching operations, the first code point should be the processing baseline. For example, if two Han characters appear together, such as "龍VS天" (U+9F8D U+E0100 U+5929), searching "龍天" should match the relevant content. If necessary, the browser can also include some preference to config "precise" string searching operations.

Some IVS examples:

Currently, Blink and WebKit supports this, but Gecko does not.

Maybe we can add it to https://w3c.github.io/string-search/#orthoVariation ?