mikf / gallery-dl

Command-line program to download image galleries and collections from several image hosting sites
GNU General Public License v2.0
11.37k stars 926 forks source link

Downloading Instagram profile with 10,000+ posts. What is the optimal approach without getting banned? #2413

Open dl21g5 opened 2 years ago

dl21g5 commented 2 years ago

I've used gallery-dl flawlessly to download profiles ranging from 500-1500 posts on an account with my mobile number without a ban. Now I'm trying to download an instagram profile with 10,000+ posts, but after doing some research I've seen posts from 2020 where users mention they received a locked account/ban.

https://github.com/mikf/gallery-dl/issues/1149

https://github.com/mikf/gallery-dl/issues/1113

Has anyone done this recently and know if this is still an issue today? What would be an optimal way of downloading the entire 10k posts profile without running into these two issues?

AlttiRi commented 2 years ago

At least you can interrupt the downloading (Ctrl+C) and continue it the next day. Use -v for verbose log. You need to take the last cursor value.

Then just use additionally

-o cursor=ABCD123...qweAQWE==

to continue the downloading from the last position.

mikf commented 2 years ago

Default wait times for Instagram have increased quite a bit since https://github.com/mikf/gallery-dl/issues/1113 and https://github.com/mikf/gallery-dl/issues/1149. I think it was only 5 seconds at one point, not it is 6-12 seconds. You can increase that even more with --sleep-request or plain --sleep to be even safer. Or you try your luck with other specialized instagram downloaders such as https://github.com/arc298/instagram-scraper

afterdelight commented 1 year ago

if the downloaded posts have been saved with --download-archive, does gallery-dl make less request on the next run when scraping full profile?

mikf commented 1 year ago

No, it will still do the same amount of API requests.