Open pravdomil opened 3 years ago
This is also relevant to the fact that Char.isLower
only works for ASCII, a regex-based alternative could make use of those classes
This is also relevant for sanely matching/stripping/replacing Emoji in Elm, with matchers like \p{Emoji}
, \p{Emoji_Modifier}
, \p{Emoji_Component}
, \p{Emoji_Modifier_Base}
, \p{Emoji_Presentation}
.
Usefull for example:
"čočka".match(/\p{Letter}+/u)
https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Regular_Expressions/Unicode_Property_Escapes