Open kshawkin opened 2 years ago
The Note is correct and the encoding incorrect. It should be <lb break="no"/>
when the line break is inside a word.
I just checked the Guidelines on <pc>
and again, the encoded example is backwards. The @force
attribute is "strong" when the punctuation mark is a word separator, and "weak" when it is not. In this case, the hyphen appears in side the word "character" so it doesn't serve as a word break character.
I think it should be:
char<pc force="weak">-</pc><lb break="no"/>acter
Also, we might try to force a linebreak where the hyphen is in the source document rendition so the hyphen doesn't look odd.
Thank you for the quick detective work!
In the third row of our table, we have:
UTF-8 is a char<pc force="strong">-</pc><lb break="yes"/>acter encoding for Unicode.
The code sample involves
force="strong"
andbreak="yes"
, but the note implies that it hasforce="weak"
andbreak="no"
. It's been too long since I thought about any of this, so I'm not even sure what is correct here. I vaguely recall that @sydb wrote this section?