A search for the regexp \S\x{0964} gave 27756 hits.
A search for the regexp \s\x{0964} gave 1855 hits.
NB. These results relate to my fork of the repo after my commits to the master branch.
This prompts the question:
Should there be a space before the Devanagari Danda?
In the Assamese Bible, those with a space are in a minority, being less than 6.3% of the total.
cf. In the Punjabi Bible, the results are quite the opposite!
A search for the regexp \S\x{0964} gave 1547 hits.
A search for the regexp \s\x{0964} gave 21326 hits.
Here, those without a space are in a minority, being less than 6.8% of the total.
What is the typographical standard in this matter for the various languages that use an Indic script?
NB. If some sort of space is required before the Danda, it's conceivable that it should be U+2008 PUNCTUATION SPACE rather than an ordinary space.
If you can provide a definitive answer to the main question, I can readily run a simple bespoke TextPipe filter on the USFM files to ensure that all the occurrences of the Devanagari Danda use the same style.
A search for the regexp
\S\x{0964}
gave 27756 hits. A search for the regexp\s\x{0964}
gave 1855 hits.NB. These results relate to my fork of the repo after my commits to the master branch.
This prompts the question:
In the Assamese Bible, those with a space are in a minority, being less than 6.3% of the total.
cf. In the Punjabi Bible, the results are quite the opposite! A search for the regexp
\S\x{0964}
gave 1547 hits. A search for the regexp\s\x{0964}
gave 21326 hits. Here, those without a space are in a minority, being less than 6.8% of the total.What is the typographical standard in this matter for the various languages that use an Indic script?
NB. If some sort of space is required before the Danda, it's conceivable that it should be
U+2008 PUNCTUATION SPACE
rather than an ordinary space.