SuffolkLITLab / FormFyxer

A tool for learning about and pre-processing forms
MIT License
11 stars 1 forks source link

Improve field finding in PDFs: #55

Closed BryceStevenWilley closed 2 years ago

BryceStevenWilley commented 2 years ago

Compare

Admittedly cherry-picked, but it's because we were pretty good before and not much changed in many forms except for font size. We were finding checkboxes, but I tweaked some params there to find more, because many were too small.

Before

Screenshot from 2022-08-19 17-45-34

After

Screenshot from 2022-08-19 17-41-41

BryceStevenWilley commented 2 years ago

~Draft until I patch the bracket checkbox finder (in https://github.com/SuffolkLITLab/FormFyxer/tree/better_brackets, will likely remove all of the special boxdetect code).~ Ready for review now! Currently running on some PDFs to make sure it's not worse on some PDFs, but trying not to get too held back by the results