internetarchive / epub

For code related to making ePub files
40 stars 3 forks source link

Error processing lovemelittlelov04readgoog #16

Open mikemccabe opened 14 years ago

mikemccabe commented 14 years ago

Thanks QuickReader for spotting this one.

PATH=/sw/bin:$PATH /usr/bin/python ../epub/convert_iabook.py --epub -d lovemelittlelov04readgoog /Users/mccabe/s/archive/epub/epub.py:6: UserWarning: This version of libxml2 has a known XPath bug. Use it at your own risk. from lxml import etree pamcut: You have specified a right edge (2560) that is beyond the right edge of the image (2559) pnmscale: Error reading magic number from Netpbm image stream. Most often, this means your input file is empty. pnmtojpeg: Error reading magic number from Netpbm image stream. Most often, this means your input file is empty. ERROR: lovemelittlelov04readgoog.epub: image file OEBPS/images/picture1.jpg is too short

Check finished with warnings or errors!

Epubcheck Version 1.0.3

tfmorris commented 8 years ago

This processes without generating errors now, but because the title and copyright page identification failed for _scandata.xml, the first page is lost because it's tagged as a title page, which only gets processed as an image, not text.

Whatever tool generates the _scandata file needs to be fixed to correct this issue (or the epub tool needs to implement its own page layout analysis and page type identification).