openzim / zimit

Make a ZIM file from any Web site and surf offline!
GNU General Public License v3.0
330 stars 24 forks source link

zimit refuses to pull pdf files stored on wordpress site #81

Closed Popolechien closed 3 years ago

Popolechien commented 3 years ago

Ran a zimit session of https://educate.gov.jm/index.php/resources/ The page is pretty straitghtforward inasmuch as it is a wordpress site with a list of links. Each link is to a pdf file.

Zimit copied the landing page (here is the recipe) but failed to collect the PDF files behing the links (only returned net::ERR_FAILED)

kelson42 commented 3 years ago

@ikreymer Sounds like a problem with the spider. Would you be able to confirm?

kelson42 commented 3 years ago

@rgaudin Can we link this please with an upstream ticket?

rgaudin commented 3 years ago

This is eventually a recipe mistake as the PDF files, while on the same domain name, are below the home page and thus the default scope:

You can fix the recipe by setting the --scope param to https://educate.gov.jm/