FreeBiblesIndia / Assamese_Bible

Assamese Bible (অসমীয়া বাইবেল). This work is made available under a Creative Commons Attribution-ShareAlike 4.0 International License.
http://www.freebiblesindia.in/bible/asm/
Other
3 stars 4 forks source link

Should there be a space before the Devanagari Danda? #18

Open DavidHaslam opened 7 years ago

DavidHaslam commented 7 years ago

A search for the regexp \S\x{0964} gave 27756 hits. A search for the regexp \s\x{0964} gave 1855 hits.

NB. These results relate to my fork of the repo after my commits to the master branch.

This prompts the question:

Should there be a space before the Devanagari Danda?

In the Assamese Bible, those with a space are in a minority, being less than 6.3% of the total.

cf. In the Punjabi Bible, the results are quite the opposite! A search for the regexp \S\x{0964} gave 1547 hits. A search for the regexp \s\x{0964} gave 21326 hits. Here, those without a space are in a minority, being less than 6.8% of the total.

What is the typographical standard in this matter for the various languages that use an Indic script?

NB. If some sort of space is required before the Danda, it's conceivable that it should be U+2008 PUNCTUATION SPACE rather than an ordinary space.

DavidHaslam commented 7 years ago

If you can provide a definitive answer to the main question, I can readily run a simple bespoke TextPipe filter on the USFM files to ensure that all the occurrences of the Devanagari Danda use the same style.