alexgand / springer_free_books

Python script to download all Springer books released for free during the 2020 COVID-19 quarantine
GNU General Public License v3.0
1.64k stars 366 forks source link

Error: probably not a valid book #113

Open michalastocki opened 4 years ago

michalastocki commented 4 years ago

Hello, do you have any ideas why it's not downloading any of the books? I have all the dependencies and the python code seems to be running, but ot doesn't download anything right now, skipping every book.

Does it mean that Springer recovered all the books from their site?

Thanks!

image

wallacefsilva commented 4 years ago

I'm having the same issues. It seems they implemented a security check via recaptcha...

renanxcortes commented 4 years ago

Yep.. it seems I'm facing the same type of issue in the R version of this package (https://github.com/renanxcortes/springerQuarantineBooksR/issues/53). Any clue on how to solve this issue?

lgabs commented 4 years ago

I'm getting the same errors:

image

renanxcortes commented 4 years ago

It seems like springer tweaked the reCaptcha step and the recent workaround is not enough to avoid getting flawed files :(

chaosAD commented 4 years ago

I have a raw working prototype to get around the latest hurdle. My code also manipulates the HTTP cookies to get past it, but in an elaborate way. I see Springer put other cookies that are used to track downloading users as well. This makes it easier for their monitoring software to analyze and then flag a downloading bot. Obviously, they do not allow downloading using bots and I feel we are playing cat and mouse with Springer. For that reason, I am not going to pursue on this.

SanJJ1 commented 4 years ago

So does this mean that this project is dead?

pbl987 commented 3 years ago

I have a raw working prototype to get around the latest hurdle. My code also manipulates the HTTP cookies to get past it, but in an elaborate way. I see Springer put other cookies that are used to track downloading users as well. This makes it easier for their monitoring software to analyze and then flag a downloading bot. Obviously, they do not allow downloading using bots and I feel we are playing cat and mouse with Springer. For that reason, I am not going to pursue on this.

There is no need for playing cat and mouse - JDownloader does that! They can solve automatically weak captchas, and semi-automatically with google.

I want to save at least the covid-package, as this wont be available in two weeks!!! Currently 389 books are available, it would be doable to enter a captcha every 5 download or so.

Could you please reconsider helping the project?