openzim / gutenberg

Scraper for downloading the entire ebooks repository of project Gutenberg
https://download.kiwix.org/zim/gutenberg
GNU General Public License v3.0
127 stars 37 forks source link

Scraper now dies #51

Closed kelson42 closed 6 years ago

kelson42 commented 7 years ago

$gutenberg2zim -l en --dlc=1

[pdf] not avail. for #12506# Critiques and Addresses
                html already exists at dl-cache/12506.html
        Downloading content files for Book #12507
                epub already exists at dl-cache/12507.epub
[pdf] not avail. for #12507# The History of the Rise, Progress and Accomplishment of the Abolition of the African Slave Trade by the British Parliament (1808)
                html already exists at dl-cache/12507.html
        Downloading content files for Book #12508
                epub already exists at dl-cache/12508.epub
[pdf] not avail. for #12508# The Meaning of Good—A Dialogue
                html already exists at dl-cache/12508.html
        Downloading content files for Book #12509
                epub already exists at dl-cache/12509.epub
[pdf] not avail. for #12509# The Moon Rock
                html already exists at dl-cache/12509.html
        Downloading content files for Book #12510
                epub already exists at dl-cache/12510.epub
[pdf] not avail. for #12510# Targum
                html already exists at dl-cache/12510.html
        Downloading content files for Book #12511
                epub already exists at dl-cache/12511.epub
[pdf] not avail. for #12511# Blackwood's Edinburgh Magazine — Volume 53, No. 332, June, 1843
                html already exists at dl-cache/12511.html
        Downloading content files for Book #12512
                epub already exists at dl-cache/12512.epub
Segmentation fault (core dumped)
kelson42 commented 7 years ago

By restarting the script, it always dies, but not exactly on the same book.

kelson42 commented 6 years ago

This seems to be a problem with python2, we need to use python3.