mikf / gallery-dl

Command-line program to download image galleries and collections from several image hosting sites
GNU General Public License v2.0
11.74k stars 960 forks source link

Posts of some authors are not loading [SubscribeStar] #4309

Open jamkinson opened 1 year ago

jamkinson commented 1 year ago

I have several paid subscriptions on subscribestar and successfully download paid posts. But I recently subscribed to a new author and when I try to download posts I see '[subscribestar][info] No results for {URL_AUTHOR}'

I don't know what this is about, but I can share an account with subscriptions for testing in DM gitter.im.

Also, I can't download specific paid posts by this author by URL.

Sample post URL: https://subscribestar.adult/posts/950609

I am using the following connection string: gallery-dl --cookies-from-browser chrome/subscribestar.adult https://subscribestar.adult/posts/950609

The problem is observed only in one of the 5 paid authors

Hrxn commented 1 year ago

Wait... it's working for four out of five creators you have valid subscriptions for, but for one it's not working? That does not really make any sense.. maybe it's a site change in progress?

mikf commented 1 year ago

Maybe this is because --cookies-from-browser chrome no longer works when Chrome is running? (https://github.com/mikf/gallery-dl/issues/4195)

jamkinson commented 1 year ago

I have --cookies-from-browser chrome works fine at the same time as running chrome. If this reason really a problem, then I would not be able to get content from other authors.

jamkinson commented 1 year ago

Here I am trying to get the content of the posts. I subscribe to these authors and in 1 case I successfully get the content, but in the other case nothing happens. image

Here I'm trying to get the list-keywords for a post from a new author and I see that it doesn't even show the post id. image

jamkinson commented 1 year ago

@mikf, I managed to learn a little more from this case: When trying to download all content or a single post of this author, "redirect_path" c "{author_name}/verify_subscriber" is returned, at this step the site asks to complete the captcha.

I think this is a new way to deal with download automation. Perhaps this can be circumvented by injecting a webdriver call to enter captcha when such requests are detected.

image

biggestsonicfan commented 2 weeks ago

Finally running into this one myself. Usually I complete the captcha and --from-browser works fine, but not for this new, specific, instance I have encountered.

KanashiiPanda, as used for subscribestar extractor's tests, can access the reCAPTCHA verification here. The server header seems to read cloudflare but throws a code of 200, which I think isn't caught by current cloudflare detection. Gallery-dl returns response here and never gets to the captcha check.

EDIT: It's got to be cookie related. Exporting and using cookies.txt bypasses this, I can download just fine with it. There's some fingerprinting not being pulled in from the browser.