kmadathil / sanskrit_parser

Parsers for Sanskrit / संस्कृतम्
MIT License
68 stars 21 forks source link

Different Transliteration Schemes for Input #165

Closed gasyoun closed 3 years ago

gasyoun commented 3 years ago

Can the script tolerate not only sambhūyasamutthāna, but also saṁbhūyasamutthāna?

issue

kmadathil commented 3 years ago

We can handle most Brahmi-derived scripts and some transliterations such as SLP1, ITRANS, IAST, HK, and Velthuis. Which transliteration uses ṁ?

avinashvarna commented 3 years ago

This is valid IAST. We depend on the indic_transliteration package to handle the transliteration. May need to see if there is some error in the package, or our wrapper around it.

kmadathil commented 3 years ago

IAST uses ṃ, not ṁ

Screenshot from 2021-03-24 10-28-01

avinashvarna commented 3 years ago

Oops. Sorry. didn't notice the dot above vs below. It looks like it represents a different character in ISO - https://en.wikipedia.org/wiki/International_Alphabet_of_Sanskrit_Transliteration#Comparison_with_ISO_15919

gasyoun commented 3 years ago

It looks like it represents a different character in ISO

Yes, and it's quite popular. IAST is not as IAST everywhere.

Which transliteration uses ṁ?

Those that want to differ different anusvaras or anusvara and anunasika.

kmadathil commented 3 years ago

Can you request that indic_transliteration support this encoding? Our support will come automatically.

kmadathil commented 3 years ago

We have two options 1) Handle this under "normalize" if it is indeed a common use-case 2) Get an ISO15919 scheme implemented, where this is legal

gasyoun commented 3 years ago

We have two options

Anyway will do.

kmadathil commented 3 years ago

Since Vishvas isn't too keen to implement it, this will happen as and when we have some extra bandwidth. In the meantime, please use IAST.

kmadathil commented 3 years ago

A temporary solution has been implemented, which internally forces ISO15919 to IAST. We will implement a full ISO15919 later.

kmadathil commented 3 years ago

Added UI support for this. Please remember to uncheck Strict Mode if you want to use ISO15919 smb