Closed wlpotter closed 1 year ago
@wsalesky Can you help us understand if we need to change the data or change the serialization into HTML?
Here is the code:
And here is the problem, it looks like the new line breaks before and after the TEI:quote elements are becoming spaces inside the quotation marks that we do note want:
Should we edit the data somehow?
Thanks
@davidamichelson @wlpotter which record is this? (Link to github is fine)
It's this record: https://github.com/srophe/caesarea-data/blob/master/data/testimonia/tei/214.xml
And these lines in particular.
Though I think this issue affects most, if not all, records
This is either an issue with BaseX's indentation setting (i.e., it's trying to 'pretty print' the output and introducing meaningful whitespace). Two options:
Note to self (and thanks @wsalesky for the suggestion) setting BaseX's options to not indent the xml output fixed this problem
set serializer indent=no
(this is for the general output of the query)set exporter indent=no
(this is for the output when running update queries that are written to disk)
This issue is actually a data issue.
This will need a find-and-replace; or I'll need to dive back in to troubleshooting whitespace in basex