sile-typesetter / sile

The SILE Typesetter — Simon’s Improved Layout Engine
https://sile-typesetter.org
MIT License
1.62k stars 97 forks source link

Fix french handling of U+00A0 (nbsp) #1949

Closed Omikhleia closed 6 months ago

Omikhleia commented 6 months ago

Closes #1948

Interestingly, this was covered by test feat-unicode-nbsp: the PDF result looks the same, but in debug it does show something was off.

Additional checks: (In progress, that's an amount of pages to check ;) Re-processing the SBL and the NCL from https://github.com/Freely-Given-org/BibleTypesetter/pull/3 -- both have U+00A0 in their input.

Omikhleia commented 6 months ago

NCL and SBL :heavy_check_mark:

I added a commit for removing duplicated nbsp -- Those occurred quite often (inconsistently, and fairly often before high punctuation signs and colons) in both these documents... As noted in the commit comment, there could be other places where a typist used multiple nbsp to really create some sort of indentation, but this is not a good typographical practice, so I kept it simple and went for mere removal: whether nbsp is supposed to be non-collapsing or not is a matter of debate, clearly, but this is ill-defined.