ultralytics / google-images-download

Google/Bing Images Web Downloader
https://ultralytics.com
GNU Affero General Public License v3.0
298 stars 85 forks source link

aborts downloading prematurely #12

Open NightMachinery opened 3 years ago

NightMachinery commented 3 years ago
python bing_scraper.py --search 'Buzz Lightyear' --download --limit 300 -o /x/ --chromedriver /usr/local/bin/chromedriver          

Searching for https://www.bing.com/images/search?q=Buzz%20Lightyear
Downloading HTML... 1376820 elements: 100%|███████████████████████████| 30/30 [00:16<00:00,  1.85it/s]
Downloading images...
1/300 https://vignette.wikia.nocookie.net/buzz-lightyear-rides/images/e/e8/Robot_Toy.jpg/revision/latest Invalid or missing image format. Skipping...
1/300 https://cdnb.artstation.com/p/assets/images/images/026/253/135/medium/eugene-napadovskiy-nos-4-a2.jpg 
2/300 https://www.gratistodo.com/wp-content/uploads/2016/10/Toy-Story-Wallpapers-6.jpg 
3/300 http://www.littlebcakes.com/wp-content/uploads/2014/01/Bumble-Bee-Cake-764x1024.jpg 
4/300 https://spongekids.com/wp-content/uploads/2014/03/costumes-for-kids/52-buzz-lightyear-kid-costume-idea.JPG Invalid or missing image format. Skipping...
4/300 https://colorearimagenes.net/wp-content/uploads/2015/11/toystory1.gif4_.jpg 
5/300 https://spongekids.com/wp-content/uploads/2014/10/super-cool-costume-ideas/11-scarecrow-costume.jpg 
6/300 http://www.lubbockonline.com/storyimage/TX/20121121/LIFESTYLE/311219834/AR/0/AR-311219834.jpg 
7/300 http://blog.holidaydiscountcentre.co.uk/wp-content/uploads/2014/10/Alice-in-Wonderland-by-Loren-Javier-via-Flickr-576x384.jpg 
8/300 https://www.littlebcakes.com/wp-content/uploads/2014/01/Kitty-Cat-Cakes-760x1024.jpg 
Unfortunately all 291 could not be downloaded because some images were not downloadable. 8 is all we got for this search filter!
Done with 2 errors in 77.0s. All images saved to /Users/evar/Base/_Code/misc/google-images-download/images
glenn-jocher commented 10 months ago

@NightMachinery it seems like the image downloader was unable to download a significant portion of the requested images. This could be due to various reasons such as invalid or missing image formats, or the images being undownloadable.

If you'd like to troubleshoot this further, you may want to consider checking the URLs of the failed downloads to see if they are accessible and in a valid image format.

For more detailed usage and troubleshooting, you can refer to the documentation at https://docs.ultralytics.com, or feel free to ask for further assistance.