funderburkjim / boesp-prep-ab

Indishe Spruche preparation for Andhrabharati
0 stars 0 forks source link

any correction in encoding needed? #5

Closed Andhrabharati closed 3 years ago

Andhrabharati commented 3 years ago

; 373 : page 1.68

utf8: tAdRgyAdRghyuttaptavAlukAnikaraH slp1: tAdfgyAdfGyuttaptavAlukAnikaraH

actual devanagari: तादृग्यादृग्‍ह्युत्तप्तवालुकानिकरः

funderburkjim commented 3 years ago

I don't see a problem with the conversion from HK (which is the encoding of the utf8) text, and the slp1. The programmatic conversion to deva is तादृग्यादृघ्युत्तप्तवालुकानिकरः and I don't see a problem there.

I guess the problem you are mentioning is with the 'gh' in HK. HK has several problems, and this is one of them. In HK, two letters 'gh' are required to get the aspirated 'g'. In this case, the real desideratum is to represent two consonants 'g' followed by 'h'. There is no way to do this in (standard) HK.

So, what you should do is correct the Devanagari. Then I will convert this back to slp1, and there will be no loss there ('gh' in SLP1),

but, there is would be a problem if we convert back to HK. I'll have to find some workaround from standard HK to deal with this when the time comes.

Please indicate other such flaws as you notice. This will help me with the workaround.

Andhrabharati commented 3 years ago

yes, I am talking about encoding issue here, not conversion.

I already corrected the devanagari word, as posted above.

Andhrabharati commented 3 years ago

This issue seems not yet resolved.

funderburkjim commented 3 years ago

Just to emphasize, this is not an issue for slp1 and deva.

I will have to deal with it if and when we go to transcode back to HK that Thomas is working with.

But it doesn't affect your work with Sanskrit verses, except that you will have to get the Devanagari right, as you are no doubt already doing.

Andhrabharati commented 3 years ago

got it.