mikf / gallery-dl

Command-line program to download image galleries and collections from several image hosting sites
GNU General Public License v2.0
11.7k stars 954 forks source link

[deviantArt sta.sh] Only 120 files are downloaded from sta.sh even if more exist #1781

Open GTSipes opened 3 years ago

GTSipes commented 3 years ago

Using v1.18.3.

Looks like it has something to do with how sta.sh pages are loaded. I'm trying to download images from the following links, each of which have more than 120 files: https://sta.sh/2qgyev798ws https://sta.sh/21wx4mrkzpl3 https://sta.sh/2273m7roc8pl https://sta.sh/25kui331afg

When you load up a page in a browser it only has 120 files loaded and requires scrolling down to view them all. When I tried to download them via gallery-dl it only downloaded 120 files.

MarqFJA87 commented 3 years ago

I'd like to add to this by noting that gallery-dl seems unable to detect folders inside sta.sh pages, or the descriptions of said pages and their folders.

Example (NSFW): https://sta.sh/29zmrblkqdn

mikf commented 1 week ago

I just learned that it is possible to download all items in a sta.sh folder as a .zip archive by adding /zip/ to its path. This even works recursively, i.e. it includes sub-folders.

https://sta.sh/zip/2qgyev798ws https://sta.sh/zip/21wx4mrkzpl3 https://sta.sh/zip/2273m7roc8pl https://sta.sh/zip/25kui331afg

https://sta.sh/zip/29zmrblkqdn

MarqFJA87 commented 6 days ago

That method unfortunately gives the following error when I try it with the link (NSFW) that I had provided as an example above.

D:\Programs\Bulk downloaders\gallery-dl>gallery-dl https://sta.sh/zip/29zmrblkqdn --verbose
[gallery-dl][debug] Version 1.27.6
[gallery-dl][debug] Python 3.10.0 - Windows-10-10.0.19045-SP0
[gallery-dl][debug] requests 2.32.2 - urllib3 2.2.1
[gallery-dl][debug] Configuration Files ['%USERPROFILE%\\gallery-dl\\config.json']
[gallery-dl][debug] Starting DownloadJob for 'https://sta.sh/zip/29zmrblkqdn'
[deviantart][debug] Using DeviantartStashExtractor for 'https://sta.sh/zip/29zmrblkqdn'
[deviantart][debug] Using custom API credentials (client-id 14925)
[urllib3.connectionpool][debug] Starting new HTTPS connection (1): www.deviantart.com:443
[urllib3.connectionpool][debug] https://www.deviantart.com:443 "GET /stash/zip HTTP/1.1" 404 None
[deviantart][info] No results for https://sta.sh/zip/29zmrblkqdn

On a related note, this particular folder has so far defied all attempts by me to download in full; doesn't matter if it's with this program, JDownloader, or direct download, it always terminates prematurely or (in the case of gallery-dl) fails to start to begin with.

mikf commented 6 days ago

Use your browser, or curl, or wget, etc for these /zip/ links, not gallery-dl.

MarqFJA87 commented 5 days ago

Already tried, the downloads get interrupted prematurely at seemingly random points.