Closed asielen closed 4 years ago
Created a new media type: clipping Split out the compare_file function to allow different logic by file type. (PDFs don't compare easily, the hash is not stable)
New dependency, pathvalidate
New function: get_newspaper_clipping.
New function: Check url note
Added a bit more error checking for missing files. I think this was happening when hitting ancestry with too many requests too quickly. So it will flag the file as unavailable which means it will look for the file again next time it runs.
I realize there are some changes I missed. I will resubmit.
Added functionality for pulling Newspapers.com pdf clippings