Open LorenzoCianciaruso opened 5 years ago
I'm getting the same error but there doesn't seem to be any successful downloads at all.
failed to scrape URL [Errno 54] Connection reset by peer
I'm using the following command
python genre-scraper.py --genre abstract --output_dir abstract
Am having the same issue. But only when I'm running through a remote computer cluster. If I run on laptop I get 100% of the images, no throttling.
Tried removing the random.random() and it actually scraped about 500 less images.
Depending on the style/genre it sometimes doesn't download any at all. With the larger sets it usually gets somewhere 900-1500 downloads.
When running genre-scraper.py using the currently harcoded values for randomization
requests get throttled by Wikiart that returns
I think there are 2 improvements:
I'm happy to open a PR for this.