TheHockeyist / russian-untransliterator

This is a program to convert Latin text, e.g. zdravstvujte, into its Cyrillic equivalent - здравствуйте.
MIT License
0 stars 1 forks source link

Capitalization ignored #17

Open Tymewalk opened 7 years ago

Tymewalk commented 7 years ago

YA will transliterate as я and not Я.

We also need to consider Ya and yA.

TheHockeyist commented 7 years ago

Well, how do we do that? I use lowercase by default.

Does something like Igor' go to игорь and not Игорь?

TheHockeyist commented 7 years ago

Same thing would happen to yu, Yu, yU, and YU, etc.

First, all the Latin goes to Cyrillic, and then the Cyrillic is corrected. So, really, it's not yu/ju that's being corrected - it's ыу/йу that's being corrected to ю. Repeat for the others.

Here's what I suggest for the ja case (repeat for the others with same capitalization rules).

йа - я.

Йа - Я.

ЙА - Я.

йА - ??? Weird!