Open gasyoun opened 1 year ago
Let's start initially with a file whose lines are like:
russian iast
Example
Make the russian as short as possible in this. E.g.
и i
Since Russian has capital letters, a long iast I could be represented as
И ī
Note it is ok to have more than one Russian letter to correspond 1 iast letter.
For this first file, let's stick to how to represent just 1 iast letter.
Then, we'll have to understand other subtle points that this simple 'russian iast' mapping does not represent.
a а ?backwards? а ā и i и ī у u у ū ри ṛ ри ṝ л ḷ л ḹ е е ай ai o o ау au х ḥ н ṃ к k кх kh г g гх gh н ṅ ч c чх ch дж j джх jh нь ñ т ṭ тх ṭh д ḍ дх ḍh н ṇ т t тх th д d дх dh н n п p пх ph б b бх bh м m й y р r л l в v ш ś ш ṣ с s х h
Notes:
Note cyrillic reference
Thanks so much, it works!
a а ?backwards?
What do you mean by backwards? Cyrillic looks similar to Latin, but they have different Unicode numbers.
Let's move one.
джньяна
will not be found (wanted jñāna), because now we have to treat clusters in addition to converting just simple letters.
So we need джня jña
above other rules.
This change slightly different from the previous comment.
<e> <s>INIT</s> <in>джня</in> <out>jY</out> <next>INIT</next></e>
<!-- ignore these cyrillic characters -->
<e> <s>INIT</s> <in>ь</in> <out></out> <next>INIT</next></e>
Thus джньяна does not work, but джнаяна does work, as does джнана.
Another source I used is https://www.lexilogos.com/keyboard/russian_conversion.htm.
If there are many other rules required, it might be efficient for you to experiment with a local installation which includes csl-apidev. You could adjust rules in cyrillic_slp1.xml until satisfied.
As mentioned I would want to be able to type Russian letters in https://sanskrit-lexicon.uni-koeln.de/simple/ so they would be converted to latin. We'v developed a VBEE converter we use for EmEditor. The order of the replacements matters. Let's take an example:
If we search for "джня" it should return "jña" which in return in
simple
might give as "jñā" as well.If we would start with:
We would miss this case:
Only at word endings:
Only at word beginnings:
What format should I convert it to @funderburkjim?
_IAST-Rus_Converter_1.2.txt