paddymcall / SARIT

*Old* repository of the SARIT corpus
https://github.com/sarit/SARIT-corpus
7 stars 2 forks source link

Mbh-Roman version: transliteration failures #7

Closed wujastyk closed 11 years ago

wujastyk commented 11 years ago

The string Failure:no transliteration for Devanagari occurs 486 times in mahabharata-roman.xml

It's usually (always?) u'\u094d or u'\u093c that are causing the problem.

From the sample screenshots below, I think these may be Devanagāgarī input issues concerning virāmas.

Screenshot from 2012-12-19 03:11:21 Screenshot from 2012-12-19 03:12:53

paddymcall commented 11 years ago

Resolved these failures: 094d was about virāma (now a space), and 093c about nukta (still a nukta).