SuffolkLITLab / FormFyxer

A tool for learning about and pre-processing forms
MIT License
11 stars 1 forks source link

Not all fields are equal: capture the field length (bounding box) so we can detect very large text boxes and treat differently from short answer text fields #71

Closed nonprofittechy closed 1 year ago

nonprofittechy commented 2 years ago

We can assume about 6 pixels wide / character and about 12 pixels height for typical PDF font size

Compare to assumptions we're making here: https://github.com/SuffolkLITLab/docassemble-ALWeaver/blob/2ff687355240b0d93ecede8c5ba0ae40e145e5eb/docassemble/ALWeaver/interview_generator.py#L140