fatty- / daisy-pipeline

Automatically exported from code.google.com/p/daisy-pipeline
0 stars 0 forks source link

split large text files by size #351

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Some producers will have existing content with only one really huge (3MB+) text 
content document (XHTML or DTBook format). We should have an option in our 
conversion scripts to split this into several smaller files in the EPUB output. 
Having several smaller text files improves performance dramatically in reading 
systems.

The html-utils module contains an XSLT that split an XHTML document based in 
its structure,  but it would be nice to also have an option to split the text 
content document based on KB.

See also issue 309:
https://code.google.com/p/daisy-pipeline/issues/detail?id=309

Original issue reported on code.google.com by rdeltour@gmail.com on 17 Jul 2013 at 9:44