Open emylonas opened 3 years ago
Update to say this should probably also include ignoring when <gap>
element is followed by <lb break="no"/>
, which often has white space preceding it. For instance, kede0004:
Ϊο<gap reason="lost" unit="character" extent="unknown"/> <lb break="no"/>πτης
.
The white space should always be ignored when adjacent to a
and in this case, it would mean a
Chris and Luke: I will go in and change the <lb break="no"/>
so they don't have spaces around them. That will make things a bit easier for Luke.
Generally, we want to ignore
<gap>
elements. They do not have to be copied. They do have a role in determining word breaks, however -<gap>
, then you should use the spaces to determine word breaks.<w>
element, but with the attribute<w part="y">
.