FreeBiblesIndia / Hindi_Bible

Hindi Bible (हिंदी बाइबिल). This work is made available under a Creative Commons Attribution-ShareAlike 4.0 International License.
http://www.freebiblesindia.in/bible/hin/
Other
3 stars 3 forks source link

67 instances of double Devanagari Sign Anusvara #7

Open DavidHaslam opened 6 years ago

DavidHaslam commented 6 years ago

A search for the regexp \x{0902}{2,} gave 67 hits.

I expect these are all typos and that only a single Devanagari Sign Anusvara was intended in each location.

Double signs would probably be invisible to the naked eye as a consequence of how most smart fonts display. Can you see the double Anusvara in this verse?

\v 22 तब मिस्र के जादूगरों ने भी अपने तंत्र-मंत्रोंं से वैसा ही किया; तौभी फ़िरौन का मन हठीला हो गया, और यहोवा के कहने के अनुसार उसने मूसा और हारून की न मानी।

It's at the end of the eighth word तंत्र-मंत्रोंं.

btw. A search for multiple occurrences of signs Candrabindu or Visarga gave no hits.

Here are the overall counts for these three signs:

U+0901  ँ   18,114  DEVANAGARI SIGN CANDRABINDU
U+0902  ं   80,492  DEVANAGARI SIGN ANUSVARA
U+0903  ः   625 DEVANAGARI SIGN VISARGA

btw. The 67 instances of double Anusvara represent merely 0.083% of the total.

DavidHaslam commented 6 years ago

FIO. The dotted circle placeholder is visible for the extra Anusvara when viewed with these fonts:

DavidHaslam commented 6 years ago

A search for the regexp [\x{093F}-\x{094C}]{2,} gave just 2 hits.

These are only instances of a Devanagari letter with two vowel signs.

Here's the first instance: (in the word थेीं before the second comma) in Luke 4:22 \v 22 और सब ने उसे सराहा, और जो अनुग्रह की बातें उसके मुँह से निकलती थेीं, उनसे अचम्भित हुए; और कहने लगे, "क्या यह यूसुफ का पुत्र नहीं?" \x + लूका. 2:42, भजन. 45:2 \x*

Here's the other one: (in the word लेीं before the second comma) in John 12:13 \v 13 इसलिए उन्होंने खजूर की, डालियाँ लेीं, और उससे भेंट करने को निकले, और पुकारने लगे, "होशाना! धन्य इस्राएल का राजा, जो प्रभु के नाम से आता है।" \x + भजन 118:25,26 \x*

Without knowing which vowel was intended, how can these be fixed?

@joshykurian Please advise.

DavidHaslam commented 6 years ago

With the exception of the above noted instances of two vowel signs, this issue is fixed in pull request #11