Closed saibotsivad closed 6 years ago
The space is missing in the intermediate JSON form, too, so the space must have been lost during the HTML parsing.
Okay, I finally think I got everything tracked down. I pushed up a bunch of commits that fix some general whitespace issues.
Random whitespace elements are now processed, and inside of stanzas, there are now separate "line text" and "line break" objects, instead of just "line" objects. The old data model was insufficient because there are plenty of places where verses in stanzas would span multiple lines, like in Deborah's song.
The changes have been published as 1.0.0.
Thanks! ❤️
Looking at James 1:21-22 the output JSON doesn't have a space at the end of 21 like it should: https://github.com/TehShrike/world-english-bible/blob/master/json/james.json#L193
The raw HTML of that section looks like: