kanzure / pdfparanoia

pdf watermark removal library for academic papers
https://pypi.python.org/pypi/pdfparanoia
533 stars 52 forks source link

comparediffs, a tool to download, scrub, and compare PDFs #25

Closed kim-em closed 11 years ago

kim-em commented 11 years ago

I've written a script tests/diff/comparediff which automates downloading PDFs from two different sources, and verifying that pdfparanoia makes the files byte-for-byte identical.

There's a README, as well as a sample list of URLs to run on, which includes lots of examples where pdfparanoia currently fails.

kanzure commented 11 years ago

Thank you.