Closed fxerhard closed 9 months ago
I would define exceptions in the following way:
Yuki, remember we will most likely have no TABs in the texts!Xaver ErhardSent from my phone-----------------------------------Dr. Franz Xaver ErhardUniversität LeipzigInstitut für Indologie und ZentralasienwissenschaftenProjekt: Divergent Discourses (DFG/AHRC)Internes PF: 13250104081 LeipzigGermany+49 341 97 37147+49 179 7010969http://uni-leipzig.academia.edu/XaverErhardOn 25. Jan 2024, at 13:33, ykyogoku @.***> wrote: I would define exceptions in the following way:
If ཌ་ or ཊ་ appears immediately after tsheg (་) or a tab (\t), which separates sentences, it is not replaced.
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: @.***>
I could not find ཊ་ཀ་ཤོ་བྷོ་ and པཊ་ཊི in the following online dictionaries:
I put the following lines in table1:
and in table3:
With the current implement, པཊ་ཊི would be replaced by པགས་ཊི་. I will tackle this problem again after the tab-missing issue is fixed.
For these an similar terms pls see at https://dictionary.christian-steinert.de/#%7B%22activeTerm%22%3A%22Ta%20ka%20sho%20bho%22%2C%22lang%22%3A%22tib%22%2C%22inputLang%22%3A%22tib%22%2C%22currentListTerm%22%3A%22paT%22%2C%22forceLeftSideVisible%22%3Afalse%2C%22offset%22%3A0%7D AndTibetan-English Dictionarydictionary.christian-steinert.deXaver ErhardSent from my phone-----------------------------------Dr. Franz Xaver ErhardUniversität LeipzigInstitut für Indologie und ZentralasienwissenschaftenProjekt: Divergent Discourses (DFG/AHRC)Internes PF: 13250104081 LeipzigGermany+49 341 97 37147+49 179 7010969http://uni-leipzig.academia.edu/XaverErhardOn 28. Jan 2024, at 18:18, ykyogoku @.***> wrote:ཊ་ཀ་ཤོ་བྷོ་
The optimal approach would be to take up all exceptions, such as ཌ་མ་རུ་, for which the replacement should not occur, if there are few.
I agree, as I see it, there are only very few exceptions with -ཌ or -ཊ with preceding letters, that is as final consonants. ཌ་མ་ར་ is in the beginning of a syllable and པཎྜིཏ is a stack (or perhaps written as པཎ་ཊི་ཏ?)
ཌ as subscript, as in པཎྜིཏ, is treated differently from the normal form or superscript of ཌ. And it is placed in the middle of the word/syllable, so the replacement does not take place in པཎྜིཏ.
Are there more exceptions other than the following?
(?:ཊ་ཀ་ཤོ་བྷོ་|ཊ་རན་རྩ་ནད་|ཊ་སྡེ་|པཊ་ཊི་|ཌ་མ་རུ་)
If there are no more exceptions, I close this issue.
I think this is fine, but I am not sure if we caught all exceptions. Maybe leave this open for the time being until we are sure.
Ok, I leave it open.
Tables look like that --- Table1:
I don't have to consider exceptional cases where they are combined with shad, since there is no exceptional word ending with them.
Table3:
final -ཌ་ and -ཊ་ should be replaced by -གས་
exceptions are few but possible, e.g. ཊ་ཀ་ཤོ་བྷོ་, པཊ་ཊི་, or ཌ་མ་རུ་