Open RavanJAltaie opened 1 year ago
Recipe Created https://farm.openzim.org/recipes/bibebook.com_fr_all
@rgaudin the scraper here started since 10 days ago, is than normal? Shall I wait more?
You should take a look at the website before creating the recipe.
Search engines should be excluded always (except for client-side search engine) because they generate tons of useless, mostly duplicated pages.
@rgaudin you are correct, it didn't start for 20 days, it will not work out with this scraper. I'll keep an eye for any websites with search engine.
Custom scraper seems to be pretty easy to write, the website looks not updated anymore.
If one goes to http://www.bibebook.com/files/download/zip/packs_classiques/, it is visible the packs have not been updated since 2016.
The whole catalog is available at http://www.bibebook.com/files/download/catalogues/Bibebook-library.uni in XML format.
Scraper can hence simply:
Good way to learn how to write a scraper tbh.
As a part of scouting Grey Box content, we need to create zim file for the details below:
Website URL: https://www.bibebook.com/ License: Creative Commons Desired ZIM Title: Bibe Books Desired ZIM Description: 1700 ebooks gratuits Desired ZIM Icon –png (URL or attach one): Language (ISO 639-3): fra Is this a MediaWiki?: no