indic-transliteration / sanscript.js

Transliteration package for Indian scripts
MIT License
99 stars 39 forks source link

Wylie transliteration #56

Closed ayya-vimala closed 2 years ago

ayya-vimala commented 2 years ago

Thank you for a great library. Would it be possible to add Wylie to it? I think it should not be so hard to add.

vvasuki commented 2 years ago

https://en.wikipedia.org/wiki/Wylie_transliteration doesn't cover many sanskrit sounds, and other Indic scripts don't cover many Tibetan sounds. So, it isn't clear what "adding" would mean.

ayya-vimala commented 2 years ago

extended Wylie should be able to convert without loss since sanskrit transliterations occur in Tibetan (especially in names of people, texts, places etc.). Extended Wylie (EWTS) is today's de-facto standard for computer software working with Wylie. https://www.thlib.org/reference/transliteration/teachingewts.pdf

vvasuki commented 2 years ago

wylie.txt

I'm still unclear about the encoding of some sanskrit sounds - please edit the attached file and revert.

ayya-vimala commented 2 years ago

Thanks so much! I just want to double check with somebody at the university when he is back from holiday next week. I will get back to you on this.

ayya-vimala commented 2 years ago

I checked back with the person at the university and he checked your codebase as well. He said:

tibetan script is not just a 1:1 mapping, but the individual position of the characters is very improtant as well since there are many stacked letters, prescripts, postscripts, subscript, superscript

He checked out your code but felt that it is lacking the correct handling for the stacked letters to be able to add Wylie in this way with a 1-1 script. He referred me to https://github.com/buda-base/jsewts for a better transliteration system for this.

So unfortunately I think we might have to abandon this route and I offer my sincere apologies that I was unaware of the extra complexities Tibetan<->Wylie poses as opposed to other Asian scripts. It seemed a lot easier to me than it actually was.