drdhaval2785 / SanskritSorting

Codes written by Dr. Dhaval Patel for Sanskrit Natural Language Programming
2 stars 1 forks source link

Reverse 21 Issues (a/ -> á) #24

Closed gasyoun closed 9 years ago

gasyoun commented 9 years ago

http://www.sanskrit-lexicon.uni-koeln.de/scans/PWGScan/2013/web/webtc1/help/accents.html From https://github.com/drdhaval2785/SanskritSorting/issues/16 1) Let's have acute accents (Hex 0301) #kará--mardaká# instead of #kara/--mardaka/# (let us use combining accents https://en.wikipedia.org/wiki/Combining_character and http://www.alanwood.net/unicode/combining_diacritical_marks.html - so the search function will work in .pdf. For the printed book I will go http://ss64.com/unicode-accents.html way to have crisp typography)

And there should be no

| á |
#kará--mardaká#

as well.

Inside mwxml I found https://github.com/drdhaval2785/SanskritSorting/blob/master/accent_old2new.php. At least the '|<(key2)>(.*?)</key2>|' and preg_replace('|([/\\^])([aAiIuUfFeEoO])|',"$2$1" might be of some interest.

2) grave accent U+0300 13 cases of \

yajYadravya
yajYadruha\
yajYaDara
yajYASana
yajYAsA/ha\
yajYika

3) U+0302, svarita

anvAsana
anvAhArya^
apasy
apasya^
apasya
vye^nI
nIvyA^

It will look similar to

híṃsaka
híṃsa
daúḥṣvapnya
daúḥṣanti
bahíḥśri
alíṃśa
ahighnī̀
abhī̀ti

At the end.

drdhaval2785 commented 9 years ago

@gasyoun Question 1: They don't join. It gives me #kara´mardaka´# acute accent after the vowel, not above it. Only if I replace a/ -> á, I get #karámardaká#. But in that case also we are in deep trouble for ā. There is no member in character map for acute accent of ā. I guess I am missing something.

Question 2: Where to put accent in IAST in case of 'ai' and 'au' (written E and O in SLP1) on a / i ? a / u ?

drdhaval2785 commented 9 years ago

Done. see http://drdhaval2785.github.io/bookmarkstat_accent.html

Now both the question don't survive.

Currently they are not sorting properly. Accents are shown, but sorting is altered. I will have to work on it.

drdhaval2785 commented 9 years ago

Accent sorting issue is corrected in reverse22.php. Output is attached here.