hardikvasa / google-images-download

Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
MIT License
8.57k stars 2.11k forks source link

How to download more images? #4

Closed LongLong-Jing closed 7 years ago

LongLong-Jing commented 7 years ago

@hardikvasa Hi Hardikvasa, thanks for your work. I have a question about downloading the images. What can I do if I want to download more images ? Specifically, What should I do if I want to download all the images under the keywork "Taj Mahal" ? Thanks for your help.

hardikvasa commented 7 years ago

Hi LongLong-Jing@ The current script downloads all the images (100) in the first page of google images search result. For downloading more than 100 images, you will have to refine your search and add more search terms in the array.

Note: Please do go through the Disclaimer notice in the README file of this repository before downloading and using the images.

sunshineatnoon commented 7 years ago

@hardikvasa Thanks for this great script. May I ask how to download more than 100 images for a single search keyword?

hardikvasa commented 7 years ago

Hi, as of now there is no 'direct' way to download more than 100 images from a single keyword since the response receives a max of 100 results. The best way to do that would be to add more keywords/key phrases and broaden the scope of search.

Thanks!

sunshineatnoon commented 7 years ago

@hardikvasa Thanks for the reply. FYI, one potential way to solve this issue is to set time limitations, so that one can crawl 1000 images in each time interval.

hardikvasa commented 7 years ago

Hi @sunshineatnoon Every time a a page is downloaded (based on a given keyword), the crawler crawls image links from that page and downloads all the images. The page does not contain more than 100 image links at once. Even if the page is downloaded again after a given time interval, it will return the same image links.

Can you briefly explain how would you achieve this setting time limitations? Thanks!

sunshineatnoon commented 7 years ago

@hardikvasa You can refer to icrawler