TahmidAlam-git / facebook-marketplace-vehicle-archiver

A program to scrape cars from facebook marketplace and archive their pictures, details and descriptions onto internet archive - made originally to track Mitsubishi Evolution price history
3 stars 1 forks source link

corrupt image upload #17

Open TahmidAlam-git opened 2 months ago

TahmidAlam-git commented 2 months ago

On Facebook marketplace, there is a possibility that one of the images may never load at all, probably because it's corrupted or something is going on, on the backend of Facebook. Due to this when we grab the images, load them and archive them the corrupted image attempts to get archived too, but the internet archive python library can't handle a non-image upload (for our case) resulting in a failed archive attempt. We want to filter out this corrupt image before we archive it.