PactInteractive / image-downloader

Download images from the web more easily. A browser extension for Google Chrome, Microsoft Edge, and Brave.
https://chrome.google.com/webstore/detail/image-downloader/cnpniohnfphhjihaiiggeabnkjhpaldj
821 stars 300 forks source link

Can't properly pull images from archive.org .pdf #116

Open iconoclasthero opened 1 year ago

iconoclasthero commented 1 year ago

Setup

Describe the bug

I'm trying to download images from a [borrowed] pdf book on archive.org (below) and it isn't loading all the images (not a surprise, but not desirable) and when I select the few images that populate and click download it tries to save them as .txt. In the attached screenshot, I clicked the download arrow and it pops up with the filename.txt.

URL

(https://archive.org/details/lostchanceinchin0000serv/page/9/mode/1up)

Screenshots

Screenshot from 2023-11-08 10-51-31

I would really like to be able to scrape the entire book at one go so I can tesseract > piper > ffmpeg > opus audiobook

iconoclasthero commented 1 year ago

NB: when I click download, this also f--ks up the book "loan" from archive.org so I have to borrow it again.