cometkim / unicode-segmenter

A lightweight and fast, pure JavaScript library for Unicode segmentation
MIT License
37 stars 0 forks source link

Support for Unicode Emoji Sets (UTS#51) #48

Open cometkim opened 2 days ago

cometkim commented 2 days ago

https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/RegExp/unicodeSets https://www.unicode.org/reports/tr51/#Emoji_Sets

Technically this isn't part of the Unicode Segmentation, but examining the sequence of a specific emoji segment is one of the essential combinations.

for example to resolve https://github.com/matrix-org/matrix-react-sdk/pull/12582

cometkim commented 2 days ago

New matcher APIs to unicode-segmenter/emoji

It will do #40 first as this needs to script change.