vidyut-lipi Incorrect Transliteration of `ch` to `च` and `chh` to `छ` from Harvard-Kyoto to Devanagari

ambuda-org / vidyut

Infrastructure for Sanskrit software. For Python bindings, see `vidyut-py`.

49 stars 21 forks source link

vidyut-lipi Incorrect Transliteration of `ch` to `च` and `chh` to `छ` from Harvard-Kyoto to Devanagari #110

Closed CheeksTheGeek closed 7 months ago

CheeksTheGeek commented 7 months ago

Description

The transliteration of certain words from the Harvard-Kyoto system to the Devanagari script is incorrect. Specifically, the words "chaitanya" and "chhatra" are not being converted accurately.

Context

Since I'm a daily user of the English to Hindi iOS and MacOS Transliteration Keyboards, I compared my usage from those systems to this, I hope putting this issue doesn't feel demanding since the project is new and even though it's growing fast, but my usage is a bit niche so I understand if it might be on a lower priority.

Input

The input given in Harvard-Kyoto was:

chaitanya
chhatra

Expected Output

The expected Devanagari transliteration should be:

चैतन्य (for chaitanya)
छात्र (for chhatra)

Actual Output

The actual output produced by the system was:

छैतन्य (for chaitanya)
छ्हत्र (for chhatra)

akprasad commented 7 months ago

@CheeksTheGeek thanks for filing this issue!

Are you sure your input encoding is Harvard-Kyoto? Harvard-Kyoto maps c to च and ch to छ:

https://en.wikipedia.org/wiki/Harvard-Kyoto#Consonants

Likewise, chhatra is not Harvard-Kyoto for छात्र because in Harvard-Kyoto, the long vowel needs to be explicit, as in chAtra.

It seems that your keyboard is more of an IME that accepts an approximate spelling then makes the likeliest suggestion. In comparison, a transliterator like vidyut-lipi is usually less intelligent and tries to make an "exact" conversion by applying specific rules.

CheeksTheGeek commented 7 months ago

@CheeksTheGeek thanks for filing this issue!

Are you sure your input encoding is Harvard-Kyoto? Harvard-Kyoto maps c to च and ch to छ:

https://en.wikipedia.org/wiki/Harvard-Kyoto#Consonants

Likewise, chhatra is not Harvard-Kyoto for छात्र because in Harvard-Kyoto, the long vowel needs to be explicit, as in chAtra.

It seems that your keyboard is more of an IME that accepts an approximate spelling then makes the likeliest suggestion. In comparison, a transliterator like vidyut-lipi is usually less intelligent and tries to make an "exact" conversion by applying specific rules.

I'll read up on Harvard-Kyoto, I'm completely new to Transliterations and I always thought Harvard-Kyoto and ITRANS were the same thing when I came across these terms online, really sorry for putting this issue, I'll close it now

akprasad commented 7 months ago

Nothing to apologize for! Thanks for your enthusiasm, and please file more issues in the future.

CheeksTheGeek commented 7 months ago

Yupp, looking forward to exploring it!