How did your compose the Inflector::ACCENTED_CHARACTERS? Can we add non-latin symbols there?
I guess this can be done automatically using official unicode data: https://unicode.org/Public/UNIDATA/UnicodeData.txt
The 6th column shows what is the character composed of (or it's empty if the character is not accented).
Hello.
I'm trying to use
Inflector::unaccent
to compare strings the same way as mysql with utf8_unicode_ci(_ai that can be omitted) collation compare.I found out that some (or all) cyrillic letters are not unaccented. For example, russian
ё
(https://www.compart.com/en/unicode/U+0451) and ukrainianї
(https://www.compart.com/en/unicode/U+0457) which are widely used. The mysql compares them properly and removes accents.How did your compose the
Inflector::ACCENTED_CHARACTERS
? Can we add non-latin symbols there?I guess this can be done automatically using official unicode data: https://unicode.org/Public/UNIDATA/UnicodeData.txt The 6th column shows what is the character composed of (or it's empty if the character is not accented).