internetarchive / iari

Import workflows for the Wikipedia Citations Database
GNU General Public License v3.0
12 stars 9 forks source link

As a developer I want to implement catching errors from the pdf parser and test using corrupted PDFs to make the API more stable #822

Open dpriskorn opened 1 year ago

dpriskorn commented 1 year ago

Suggested by vangelis 🤩

dpriskorn commented 1 year ago

see https://github.com/ArturT/Test-PDF-Files/tree/master for pdfs to test on