Open manubider opened 5 years ago
I met the same problem.
Until there's a fix, i'd suggest trying to offset
keyword. You could even run a loop that calls this function many times and offsets by and additional 675 each time. I haven't tried this though, but i'd be interested to see if it works
@ablacklama
Unfortunately all 5000 could not be downloaded because some images were not downloadable. 814 is all we got for this search filter!
It couldn't give me a single additional photo..
I was wondering why i only got 397 photo for 500 limit on dog husky
keyword. So i checked how image search on google page behavior was. It's actually have a button that need to be clicked when it reach 397 images. Maybe this behavior have not been included on latest script, hence the crawler stopped earlier than the real image limit was.
Also, since google image itself is not an unlimited repository of images, be mindful you could reach the last image for those keyword before reaching your defined limit.
can you solve the problem??
I believe that the search simply reaches the end of the allowable scrolling range for a google/bing image search. I found this typically occurs after around 500-700 images. Since the selenium functionality of the scraper simply mimics a human using Chrome to conduct an image search manually, I don't believe a workaround is possible, other than possibly searching for slightly different search terms, and then removing duplicates in post-processing.
Hey! I'm getting a limit of 675 photos when downloading from different keyworkds, what could this be? Does google have a limit around that number? I want to download at least 10000 to train a neural network.