Closed martinpub closed 2 years ago
Thanks for the report. It seems XProc does not automatically add the doctype when storing documents and it is currently not possible to set the "html-version" serialization parameter.
Thanks for the fix @bertfrees!
Expected Behavior
epub3-to-epub3 preserves doctype from XHTML documents in input EPUB 3 fileset.
Actual Behavior
epub3-to-epub3 strips doctype from XHTML documents in input EPUB 3 fileset.
Steps to Reproduce
<!DOCTYPE html>
dp2 epub3-to-epub3 --source source.epub --data source.zip --output outputdir/ --tts false --braille false --sentence-detection false --update-lang-attributes true --update-identifier-in-content-docs true --update-title-in-content-docs true --metadata sample_metadata.xml
Details
As XHTML 5.0 is already explicitly specified in EPUB 3, I'm not sure if the HTML 5 doctype is strictly needed. However, I was uncertain if this can cause errors in certain reading systems/processing tools that might rely on HTML 5 parsing?
Environment
Logs
Logs