OpenPecha-Data / catalog

🗺
3 stars 0 forks source link

Possible Wylie Input Error #7

Open 10zinten opened 2 years ago

10zinten commented 2 years ago

https://github.com/OpenPecha/P6EB98D99/blob/c26827400c1f254313313da7a44dc7d15fb4b454/P6EB98D99.opf/base/CIII_10%20Immeasurables.txt#L1

converter.toUnicode('rigs kyi bu tshad med pa bcu po ’di dag ni gaṅ byaṅ chub sems dpas bsod nams kyi tshogs kyis')
'རིགས་ཀྱི་བུ་ཚད་མེད་པ་བཅུ་པོ་འདི་དག་ནི་གṅ་བྱṅ་ཆུབ་སེམས་དཔས་བསོད་ནམས་ཀྱི་ཚོགས་ཀྱིས'

https://github.com/OpenPecha/P6EB98D99/blob/c26827400c1f254313313da7a44dc7d15fb4b454/P6EB98D99.opf/base/CIII_10%20Immeasurables.txt#L3

>>> converter.toUnicode('po yoṅs su rdzogs pas lus kyi rgyan tshad med pa yoṅs su rdzogs par bya ba daṅ |sems can')
'པོ་ཡོṅས་སུ་རྫོགས་པས་ལུས་ཀྱི་རྒྱན་ཚད་མེད་པ་ཡོṅས་སུ་རྫོགས་པར་བྱ་བ་དṅ་༑སེམས་ཅན'

Tasks

ngawangtrinley commented 2 years ago

This is a weird wylie input. @eroux how do you deal with that? Is that just a corner case that needs manual correction or is it something that we should improve in the Wylie converter?

eroux commented 2 years ago

It's not wylie but there are mappings in https://github.com/buda-base/ewts-converter/blob/master/src/main/java/io/bdrc/ewtsconverter/TransConverter.java

10zinten commented 2 years ago

@jungtop please convert all Oslo pecha to Tibetan