faustedition / faust-gen-html

Pipelines to generate HTML for the Faust edition's reading texts and prints.
1 stars 4 forks source link

Inter-element whitespace lost #543

Closed gerritbruening closed 6 years ago

gerritbruening commented 6 years ago

As reported earlier:

                                    <rdg wit="faust://document/faustedition/S" type="type_2a*"
                                            ><emph>liebte dich</emph>
                                        <wit wit="faust://document/faustedition/S">S</wit></rdg>

leads to

                  <CharacterStyleRange AppliedCharacterStyle="CharacterStyle/Sperrung">
                     <Content>liebte dich</Content>
                  </CharacterStyleRange>
                  <CharacterStyleRange AppliedCharacterStyle="CharacterStyle/Kursiv">
                     <Content>S</Content>
                  </CharacterStyleRange>

with to words erroneously lumped together. This may also appear in Faust II. Is there a quick fix are a good search pattern I can use to spot possible cases?

pglatza commented 6 years ago

It is because of the missing / wrong whitespace normalisation before revision 79 (see also https://github.com/faustedition/faust-gen-html/issues/532#issuecomment-398404891).

Currently, with latest commit (=r79), there are two whitespace characters between <emph>liebte dich</emph> and S.

pglatza commented 6 years ago

The two spaces are from xml input. I do not add extra whitespace characters.

gerritbruening commented 6 years ago

Hier wird nichts mehr geschehen.