openzim / gutenberg

Scraper for downloading the entire ebooks repository of project Gutenberg
https://download.kiwix.org/zim/gutenberg
GNU General Public License v3.0
127 stars 37 forks source link

Crash by exporting #46860 #25

Closed kelson42 closed 9 years ago

kelson42 commented 9 years ago

By exporting only books in Portuguese I have found this bug:

$ rm -rf static/ ; ./dump-gutenberg.py --books=46860 --export EXPORTING ebooks to static folder (and JSON) [46860] Filtered book collection size: 1 Filtered book collection, PDF: 0 Filtered book collection, ePUB: 1 Filtered book collection, HTML: 1 Dumping full_by_popularity.js Dumping full_by_title.js Dumping lang_pt_by_popularity.js Dumping lang_pt_by_title.js Dumping authors_lang_pt.js Dumping auth_80_by_popularity.js Dumping auth_80_by_title.js Dumping authors.js Dumping languages.js Dumping main_languages.js Exporting Book #46860. Exporting to static/O Napoleão de Nothing Hill.46860.html Copying companion file to 46860_image002.jpg Copying /media/data/gutenberg/static/46860_image002.jpg Copying companion file to 46860_image008.jpg Copying /media/data/gutenberg/static/46860_image008.jpg Copying companion file to 46860_image001.jpg Copying /media/data/gutenberg/static/46860_image001.jpg Copying companion file to 46860_cc0.png Copying /media/data/gutenberg/static/46860_cc0.png Copying companion file to 46860_image003.jpg Copying /media/data/gutenberg/static/46860_image003.jpg Copying companion file to 46860_image007.jpg Copying /media/data/gutenberg/static/46860_image007.jpg Copying companion file to 46860_image004.jpg Copying /media/data/gutenberg/static/46860_image004.jpg Copying companion file to 46860_image006.jpg Copying /media/data/gutenberg/static/46860_image006.jpg Copying companion file to 46860_cover.gif Copying /media/data/gutenberg/static/46860_cover.gif Copying companion file to 46860_image055.jpg Copying /media/data/gutenberg/static/46860_image055.jpg Copying format file to O Napoleão de Nothing Hill.46860.epub Creating ePUB at /tmp/tmpAnHaAJ.epub Traceback (most recent call last): File "./dump-gutenberg.py", line 150, in main(docopt(help, version=0.1)) File "./dump-gutenberg.py", line 137, in main only_books=BOOKS) File "/media/data/gutenberg/gutenberg/export.py", line 155, in export_all_books books=books) File "/media/data/gutenberg/gutenberg/export.py", line 557, in export_book_to archive_name_for(book, format)) File "/media/data/gutenberg/gutenberg/export.py", line 524, in handle_companion_file optimize_epub(src, tmp_epub.name) File "/media/data/gutenberg/gutenberg/export.py", line 491, in optimize_epub with open(opff, 'r') as fd: IOError: [Errno 2] No such file or directory: u'/tmp/tmpM3AA0L/46860/content.opf'