sanskrit-lexicon / COLOGNE

Development of http://www.sanskrit-lexicon.uni-koeln.de/
18 stars 3 forks source link

Cologne standard IAST extension #392

Open funderburkjim opened 2 years ago

funderburkjim commented 2 years ago

In recent work with MD dictionary IAST, several improvements to the Cologne extension to IAST standard were made.
Since the intercoding between slp1 and iast is relevant to many dictionaries, some documentation of this work is done in the iast directory of this repository.

funderburkjim commented 2 years ago
funderburkjim commented 2 years ago

consistency

These two xml transcoding rule files should be kept consistent with similarly named files in:

funderburkjim commented 2 years ago

accent copy-paste

There are sometimes multiple visually indistinguishable unicode representations for Latin letters with diacritics. For example, the letter a with acute (udatta) accent can be represented either by

When it is needed to correct any iast in a text file, then it is advised to copy-paste from the slp1_iast.txt file. This practice will aid in providing consistent coding throughout the various dictionaries.

gasyoun commented 2 years ago

Cologne extension to IAST standard

Is there a page dedicated to in on Cologne website itself, @funderburkjim ?

Andhrabharati commented 2 years ago

H\ ḥ̀ ( Ḥ̀ ) \u1e25\u0300 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING GRAVE ACCENT H/ ḥ́ ( Ḥ́ ) \u1e25\u0301 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING ACUTE ACCENT H^ ḥ̂ ( Ḥ̂ ) \u1e25\u0302 LATIN SMALL LETTER H WITH DOT BELOW + COMBINING CIRCUMFLEX ACCENT M\ ṃ̀ ( Ṃ̀ ) \u1e43\u0300 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING GRAVE ACCENT M/ ṃ́ ( Ṃ́ ) \u1e43\u0301 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING ACUTE ACCENT M^ ṃ̂ ( Ṃ̂ ) \u1e43\u0302 LATIN SMALL LETTER M WITH DOT BELOW + COMBINING CIRCUMFLEX ACCENT

@funderburkjim

After some prolonged discussions [just about 6 months back], https://github.com/sanskrit-lexicon/PWG/issues/5#issuecomment-894630351 https://github.com/sanskrit-lexicon/PWG/issues/5#issuecomment-896184204 I thought you'd zeroed on having accents before visarga & anusvara.

Did you change your mind to keep the accents after , subsequently?