Girbons / comics-downloader

tool to download comics and manga in pdf/epub/cbr/cbz from a website
MIT License
456 stars 49 forks source link

Blank docs #75

Closed bashfulrobot closed 3 years ago

bashfulrobot commented 3 years ago

Describe the bug When downloading, I noticed my cbz file was only 22kb. Seemed small. Ran the same command as a PDF to preview on my computer, blank single white page (was the result).

To Reproduce Steps to reproduce the behavior:

./comics-downloader -url=https://readcomiconline.to/Comic/Batman-Superman-2019 -all -range=11-13 -format=pdf

and

./comics-downloader -url=https://readcomiconline.to/Comic/Batman-Superman-2019 -all -range=11-13 -format=cbz

Expected behavior

Readable comics. :-)

Screenshots

Showing the empty PDF

image

Desktop (please complete the following information):

cat -p /etc/lsb-release
DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=20.04
DISTRIB_CODENAME=focal
DISTRIB_DESCRIPTION="Ubuntu 20.04.1 LTS"
./comics-downloader -version
comics-downloader version v0.26.0
bashfulrobot commented 3 years ago

ok, the issue is specific to readcomiconline.to. The tool acts as expected with comicextra.com.

Girbons commented 3 years ago

It worked for me, there could be an antibot?

Screenshot 2020-10-31 at 13 13 51

bashfulrobot commented 3 years ago

I'm really not sure. All I know is when I swapped the site, it worked. I can visit the site manually and see everything. So I don't think there is anything related to my IP, etc.

On Sat., Oct. 31, 2020, 5:15 a.m. Alessandro De Angelis, < notifications@github.com> wrote:

It worked for me, there could be an antibot?

[image: Screenshot 2020-10-31 at 13 13 51] https://user-images.githubusercontent.com/5927088/97778926-f2e53600-1b7a-11eb-9e98-fd63acc05a07.png

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/Girbons/comics-downloader/issues/75#issuecomment-719925989, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACSIRIONLG46BAOXRONC2RDSNP5XPANCNFSM4TFP6LOA .

bashfulrobot commented 3 years ago

The other thing I noticed is that there is no progress bar displayed when it does not work. I recorded a GIF to show you what I mean.

_download-cd

Girbons commented 3 years ago

I'm really not sure. All I know is when I swapped the site, it worked. I can visit the site manually and see everything. So I don't think there is anything related to my IP, etc.

something strange is happened then 😄

The other thing I noticed is that there is no progress bar displayed when it does not work. I recorded a GIF to show you what I mean.

The progressbar does not appear if you already have these comics downloaded otherwise there's a problem

bashfulrobot commented 3 years ago

They are not downloaded. Throughout the testing I've been keeping the directory clean between each test. 👍

On Mon., Nov. 2, 2020, 1:45 a.m. Alessandro De Angelis, < notifications@github.com> wrote:

I'm really not sure. All I know is when I swapped the site, it worked. I can visit the site manually and see everything. So I don't think there is anything related to my IP, etc.

something strange is happened then 😄

The other thing I noticed is that there is no progress bar displayed when it does not work. I recorded a GIF to show you what I mean.

The progressbar does not appear if you already have these comics downloaded otherwise there's a problem

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/Girbons/comics-downloader/issues/75#issuecomment-720360346, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACSIRILZVF4LWYQPALLAHDLSNZ5VJANCNFSM4TFP6LOA .

bashfulrobot commented 3 years ago

This seems to be solved with the latest version.

./comics-downloader -version
comics-downloader version v0.26.2

I am not sure what changed there.

This is good since many many comics at comicextra are missing the images (hence my suggestion for an alternate source in my other ticket).

bashfulrobot commented 3 years ago

And after downloading a bunch of stuff, it started again. I wonder if the site detects the hits and then cuts off access for a time...

Between missing issues at comicextra and this potential throttling, I may have to read online. (boo!) haha.

Girbons commented 3 years ago

I wonder if the site detects the hits and then cuts off access for a time...

It must be it, we can try a delay between one request and another to avoid this

Yea there's an antibot after a while 😄

Girbons commented 3 years ago

Hey @bashfulrobot,

I published a pre-release with delay flag.

./comics-downloader -url https://readcomiconline.to/Comic/Batman-Superman-2019 -all -format cbz -delay 500

Let me know if this is useful or not 😄

bashfulrobot commented 3 years ago

It still seems to be an issue. Notice there is no progress bar, and the resulting file size is too small. See the animated GIF. There is a delay, so the animation may not look like it is working, but it is. Just waiting for the activity.

delay-test

Girbons commented 3 years ago

I'll release a version with debug mode soon so we can see what is happening. About the progressbar I don't know, is weird that it doesn't appear.

Girbons commented 3 years ago

there's an antibot, no issues are collected and that's why the progressbar is not rendered