Open funderburkjim opened 9 years ago
And it's totally left out in SLP1 as now, or what? In the current Cologne implementation.
dictionaries and words with Z,V
Dictionary | word | line in X.txt |
---|---|---|
AP | aGoza | 1762 |
AP | ayoga | 19395 |
AP | arDa | 20956 |
AP90 | aGoza | 3633 |
AP90 | ayoga | 27815 |
MW72 | arDa | 19829 |
MW72 | candra | 72006 |
SKD | SikzA | 411103,411246 |
An alternate IAST Unicode coding of Z and V (not currently used in Cologne displays)
slp1 | code point | character |
---|---|---|
Z | h U+0331 | h with COMBINING MACRON BELOW |
V | h U+032C | h with COMBINING CARON BELOW |
re 'And it's totally left out in SLP1 as now, or what? In the current Cologne implementation.'
To remedy 3 requires use of an adequate web font (Siddhanta) for Devanagari. For a display to use a web font, the CSS file used by the display requires certain coding. And, the Javascript of the display require changes. Experience suggests that these changes sometimes have side effects that require other changes.
So, currently, I am choosing to defer making the Devanagari look right for the ardhavisarga character. Since the characters Z,V occur so rarely (only in the words listed above), this seems acceptable.
Sure, 8 words is not a big list, still we must document this issue. If documented, not that bad at all.
I also saw some occurrences in vcp.txt. Is transcoder handling the jihvAmUlIya and upadhmAnIya properly @funderburkjim ?
Any transliteration schemes leading to lossy conversion?
For Devanagari, transcoder handling properly, I think.
Doubt it for other transliterations, such as IAST (slp1_roman.xml), etc.
In a few dictionaries, Sampada has found that missing data occurred where two visarga variants, jihvāmūlīya and upadhmānīya are being referenced. See, for example under SikzA in SKD.
This note is just to mention this oddity, and to indicate how these characters are coded, and how they are displayed. Also, to note some TODO items.
A reference for this is LIES (Linguistic Issues in Encoding Sanskrit) by Peter Scharf and Malcolm Hyman, downloadable from the publications section of Peter's web site at http://sanskritlibrary.org/tomcat/sl/-/pub/.
Another reference is http://en.wikipedia.org/wiki/Vedic_Sanskrit. Also, see http://unicode.org/charts/PDF/U1CD0.pdf.
Here's the coding:
The Devanagari is not shown, since only some fonts (siddhanta and prajna) will display any of these. The reason for the 2nd Devanagari column is that this 1CF2 unicode character displays well with siddhanta, and, according to Peter, is often used for the display of both variants. Here is an image of how it looks, which Peter describes as a pair of candras, one smiling above one frowning.
TODO: