issues
search
measuresforjustice
/
textricator
Textricator is a tool to extract text from documents and generate structured data.
https://textricator.mfj.io
GNU Affero General Public License v3.0
345
stars
38
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Copilot tests
#63
wstumbo-mfj
opened
3 days ago
0
Does this tool support extraction of data from complex PDF structure which contains incomplete boxes?
#62
ZarvisD
opened
5 days ago
1
Foreign language support
#61
JanStarman
opened
7 months ago
1
Switch to Gradle?
#60
stephen-byrne
opened
9 months ago
1
Update itext7 to itext8
#59
stephen-byrne
closed
9 months ago
1
Parsing and debugging itext5 in the future: possible?
#58
noyannus
closed
9 months ago
6
Deprecate iText5
#57
wstumbo-mfj
opened
11 months ago
0
Add CsvTextExtractor
#56
stephen-byrne
closed
10 months ago
0
bump dependencies
#55
stephen-byrne
closed
11 months ago
0
Improve logging configuration.
#54
stephen-byrne
closed
11 months ago
0
textricator-release
#53
ryanbelair-mfj
closed
3 months ago
1
fix height calculator for --text
#52
stephen-byrne
closed
12 months ago
0
Ability to parse multiple files and write to one output
#51
stephen-byrne
closed
10 months ago
0
So this is dead, right? EDIT: No, it it not!
#50
noyannus
closed
12 months ago
3
What happened to `--debug`?
#49
noyannus
closed
12 months ago
3
[Need help or bug] Fields skipped in a state loop?
#48
noyannus
opened
1 year ago
7
[Solved] Missing decimal separator
#47
noyannus
closed
11 months ago
29
[Bug] `lry` is equal to `uly` (itex5; itext7); negative height (pdfbox)
#46
noyannus
closed
12 months ago
2
Fix deprecated methods.
#45
wstumbo-mfj
closed
1 year ago
0
Update version of expr and other jars.
#44
wstumbo-mfj
closed
1 year ago
0
`TextBoxPdfTextStripper.NON_PRINTABLE` omits 001A through 001F
#43
gmarmstrong
opened
2 years ago
0
performance improvements
#42
gmarmstrong
closed
1 year ago
0
Fix Issue #40
#41
wstumbo-mfj
closed
2 years ago
0
Command Line Input does not parse --pages correctly
#40
wstumbo-mfj
closed
2 years ago
1
JAVA_OPTS was sent twice to java engine
#39
gschaer
closed
1 year ago
0
Finalize wording.
#38
wstumbo-mfj
closed
2 years ago
0
Add support for XML output.
#37
stephen-byrne
closed
12 months ago
1
README fixes.
#36
stephen-byrne
closed
2 years ago
0
Add releases to github
#35
stephen-byrne
opened
2 years ago
0
Requires Java11+
#34
stephen-byrne
closed
2 years ago
0
Ability to parse multiple files and write to one output
#33
stephen-byrne
closed
12 months ago
0
footer ignores from top, not bottom
#32
qubodup
closed
2 years ago
1
Add `--debug` command line option and `excludeConditions` feature
#31
stephen-byrne
closed
2 years ago
0
Regex problem
#30
ozkarah
closed
2 years ago
1
Generate XML
#29
stephenbmfj
closed
10 months ago
1
Bump pdfbox from 2.0.23 to 2.0.24
#28
dependabot[bot]
closed
3 years ago
1
URL to sample pdf broken
#27
jerryhall
closed
3 years ago
1
HowTo: Create a state diagram from your YML config file
#26
eabase
opened
3 years ago
1
Please add a bin release
#25
eabase
closed
3 years ago
1
Time to update dependencies...
#24
eabase
closed
3 years ago
1
Bump bcprov-jdk15on from 1.66 to 1.67
#23
dependabot[bot]
closed
3 years ago
1
What version of Yaml is the form type parsing following/using?
#22
eabase
closed
3 years ago
1
Exception in thread "main" java.lang.NullPointerException: first.font.name must not be null
#21
eabase
closed
3 years ago
2
type and filter usage in Table type of processing is not working
#20
eabase
closed
3 years ago
13
Failing to make a simple example with exactly formatted PDF
#19
eabase
closed
3 years ago
4
WARNING: An illegal reflective access operation has occurred
#18
eabase
closed
3 years ago
5
How to get more verbose debug output?
#17
eabase
closed
3 years ago
1
How to parse sub-rows in a column?
#16
eabase
closed
3 years ago
3
Where is the PDF for the table example in README?
#15
eabase
closed
3 years ago
0
Help.
#14
Gitme76269
closed
3 years ago
7
Next