issues
search
SuffolkLITLab
/
FormFyxer
A tool for learning about and pre-processing forms
MIT License
11
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Validate the measures in here against the larger PDF dataset and with experts
#89
nonprofittechy
opened
1 year ago
0
Complexity score doesn't factor in the estimated length of the answer
#88
nonprofittechy
closed
1 year ago
0
"third-party" answers aren't part of the complexity score right now
#87
nonprofittechy
closed
1 year ago
0
Measure font size
#86
nonprofittechy
opened
1 year ago
2
Directly measure whitespace
#85
nonprofittechy
opened
1 year ago
2
Use currency symbol as a complexity marker?
#84
nonprofittechy
opened
1 year ago
1
Add more actionable information to stats
#83
nonprofittechy
closed
1 year ago
0
GPT3 checking and stability
#82
BryceStevenWilley
closed
1 year ago
1
Citations don't have useful information
#81
BryceStevenWilley
closed
1 year ago
1
Assorted Form Complexity Finishes
#80
BryceStevenWilley
closed
1 year ago
0
get_existing_pdf_fields_with_context
#79
BryceStevenWilley
opened
1 year ago
0
add ignore types for transformers and openai, reformat with black
#78
nonprofittechy
closed
1 year ago
1
Make Spot token a parameter
#77
nonprofittechy
closed
1 year ago
0
Can we use heuristics to guess a form's title?
#76
nonprofittechy
closed
1 year ago
2
Rely on numpy more
#75
BryceStevenWilley
closed
2 years ago
1
Fix types
#74
BryceStevenWilley
closed
2 years ago
0
Detect amount of words that are written in all capital letters
#73
nonprofittechy
closed
1 year ago
1
Separate out fields by type
#72
nonprofittechy
closed
2 years ago
1
Not all fields are equal: capture the field length (bounding box) so we can detect very large text boxes and treat differently from short answer text fields
#71
nonprofittechy
closed
2 years ago
0
Add sentence count, passive voice and citations
#70
nonprofittechy
closed
2 years ago
2
Reformat with Black, start adding types
#69
nonprofittechy
closed
2 years ago
0
Incompatible types in pdf_wrangling.py
#68
nonprofittechy
closed
2 years ago
0
Should be able to give most of the stats even if the PDF doesn't have any form elements yet
#67
nonprofittechy
closed
2 years ago
0
What is the right way to measure readability - where do we segment text?
#66
nonprofittechy
closed
1 year ago
2
We need to preserve some special characters, like the section symbol, parentheses, etc, or do citation checking at an earlier step in the pipeline
#65
nonprofittechy
closed
2 years ago
0
Use PikePDF and pdfminer.six
#64
nonprofittechy
closed
2 years ago
1
Track down big errors with readability statistics
#63
nonprofittechy
closed
2 years ago
0
Add passive voice detection
#62
nonprofittechy
closed
2 years ago
0
Identify and capture the form "type"
#61
nonprofittechy
opened
2 years ago
0
Feature to analyze vocabulary?
#60
nonprofittechy
closed
2 years ago
1
Feature to find similar sentences that aren't identical?
#59
nonprofittechy
opened
2 years ago
0
Make text fields in table cells
#58
BryceStevenWilley
opened
2 years ago
1
Better brackets
#57
BryceStevenWilley
closed
2 years ago
0
Create an easy API for all of the values that go into the complexity score
#56
nonprofittechy
closed
1 year ago
0
Improve field finding in PDFs:
#55
BryceStevenWilley
closed
2 years ago
1
Make list_codes column valid JSON
#54
nonprofittechy
opened
2 years ago
0
Field normalizer shouldn't use reserved Docassemble or Python keywords for variable names
#52
nonprofittechy
opened
2 years ago
0
Add a function to replace radio buttons with a pair/list of checkboxes
#51
nonprofittechy
opened
2 years ago
0
Add a function to replace drop-down menus with text fields of the same size
#50
nonprofittechy
opened
2 years ago
0
Add a function to normalize font size and checkbox style
#49
nonprofittechy
opened
2 years ago
0
Make the add_pdf_fields() function work on image files?
#48
nonprofittechy
opened
2 years ago
1
Allow overwriting input file when recognizing form fields
#47
nonprofittechy
closed
2 years ago
1
Merge lines that are next to each other with no text into one big field
#46
nonprofittechy
opened
2 years ago
2
small fixes and cleanups in lit_explorer
#45
BryceStevenWilley
closed
2 years ago
2
Spacy on install
#44
BryceStevenWilley
closed
2 years ago
0
Problem using FormFyxer on a new machine still
#33
nonprofittechy
closed
2 years ago
11
Should `swap_pdf_page` be renamed?
#32
nonprofittechy
closed
1 year ago
0
Form field recognizer creates overlapping fields and doesn't recognize all table cells or round radio buttons
#31
nonprofittechy
opened
2 years ago
2
Form field recognizer should break up long lines if there is text in the middle of the line
#30
nonprofittechy
opened
2 years ago
4
Adjust API of swap_pdf to make a general-purpose utility to copy fields
#29
nonprofittechy
closed
2 years ago
1
Previous
Next