After running Finch on a couple of thousand images, I noticed that some sites have expiry dates on the URLs given by the Vision API, and will serve dummy/error images whilst still returning 200 OK.
The only way I can think of fixing this is to introduce a perceptual hash algorithm to check that the fetched image is correct.
After running Finch on a couple of thousand images, I noticed that some sites have expiry dates on the URLs given by the Vision API, and will serve dummy/error images whilst still returning 200 OK.
The only way I can think of fixing this is to introduce a perceptual hash algorithm to check that the fetched image is correct.