codeforsanjose / OpenDSJ-2018

Inform voters about 2018 San José, California and local candidates' campaign finance.
MIT License
7 stars 9 forks source link

testing new script capable to parsing through pdf files. #76

Closed imteazs closed 5 years ago

imteazs commented 5 years ago

I was finally able to find an OCR package that could be utilized to read the scanned pdf forms. Right now code is able to perform on only 1 pdf, but eventually we should be able to optimize and read through all of them and save the data into a database.