openzim / gutenberg

Scraper for downloading the entire ebooks repository of project Gutenberg
https://download.kiwix.org/zim/gutenberg
GNU General Public License v3.0
128 stars 37 forks source link

--keep-db should be replaced #15

Closed kelson42 closed 7 years ago

kelson42 commented 9 years ago

Parsing all the RDF is a really long process, even on a good computer. The default behaviour should not to remove and reparse everything.

If no action is given (--parse, --download, --export, --do-everything...) then all the steps should be done and then the script work in update mode, that means that only missing books/RDF are parsed. If --books is specified, only the corresponding books are parsed.

The use should still have the ability to erase the db and we should provide a new action called --erase-db