cdhorn / ancestry-tools

Tools for working with Ancestry.com Gedcom files and all associated media items
Apache License 2.0
37 stars 7 forks source link

Newspapers com #13

Closed asielen closed 4 years ago

asielen commented 4 years ago

Added functionality for pulling Newspapers.com pdf clippings

asielen commented 4 years ago

Created a new media type: clipping Split out the compare_file function to allow different logic by file type. (PDFs don't compare easily, the hash is not stable)

New dependency, pathvalidate

New function: get_newspaper_clipping.

New function: Check url note

Added a bit more error checking for missing files. I think this was happening when hitting ancestry with too many requests too quickly. So it will flag the file as unavailable which means it will look for the file again next time it runs.

asielen commented 4 years ago

I realize there are some changes I missed. I will resubmit.