Open DavidHaslam opened 7 years ago
In Unicode we are not limited to choosing between space or no space.
There are also code points for
EM SPACE
EN SPACE
FIGURE SPACE
THREE-PER-EM SPACE
FOUR-PER-EM SPACE
SIX-PER-EM SPACE
HAIR SPACE
PUNCTUATION SPACE
TAG SPACE
THIN SPACE
ZERO WIDTH SPACE
NO-BREAK SPACE
NARROW NO-BREAK SPACE
ZERO WIDTH NO-BREAK SPACE
NB. Many of the above special space characters are not supported by the Raavi font. The are supported by fonts Arial Unicode MS, Code2000 and FreeSans.
In some languages (e.g. French) there are typographical rules regarding the spacing of quotation marks. This makes particular sense when text contains nested quotations. In Gurmukhi, it may be worth considering making use of such a rule.
I have also observed that when there is no space between a punctuation mark and a Punjabi word, the punctuation mark becomes very small for some font engines. This is particularly a problem here in GitHub.
I have just updated my TextPipe filter to implement the following:
Insert a space after ending punctuation marks
Systematic changes restricted to within verse text:
NB. This enhancement does not address the issue of a space before such a punctuation mark.
After the above processing there are still:
Spacing rules need to be reviewed, especially for comma, question mark & exclamation mark. And some of the latter might still be a typo for a danda.
Yet to be analysed:
Space or no space after left quotation mark or before right quotation mark?
Action: Need the statistics for this, and to decide on what type of space (if any) is most appropriate.
Following this further, I just made a simple TextPipe filter to implement this:
Notes:
Summary: It's a real improvement to have such consistency in the way that quotation marks are spaced.
This issue remains open after the merge of pull request #96
Further analysis and discussion is in order.
In the concatenated USFM file, there are:
NB. For the last observation, many of these are where an exclamation mark is present where a vertical line or danda might be expected.
These inconsistencies in spacing of punctuation become apparent when you compare the list with:
Each instance in both lists should be reviewed in the light of what are the standard rules for punctuation and spacing in Gurmukhi orthography.