issues
search
clulab
/
pdf2txt
Convert PDF files to TXT
Apache License 2.0
32
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Do something about libs when this project is a dependency
#69
kwalcock
closed
9 months ago
0
Add LineWrapPreprocessor
#68
kwalcock
closed
1 year ago
0
google link to jar file dead
#67
mpetruc
closed
1 year ago
3
Conversion from 2-column PDF to "single column" output text
#66
MrUnknown789556
opened
1 year ago
1
Hyphen space needs to be handled differently
#65
kwalcock
opened
1 year ago
5
Replace nulls, update resolver
#64
kwalcock
closed
1 year ago
0
Add metadata for ScienceParse
#63
kwalcock
closed
2 years ago
0
Add settings for some local converters
#62
kwalcock
closed
2 years ago
0
Add ghostact for local OCR
#61
kwalcock
closed
2 years ago
0
Rename Textract to Amazon
#60
kwalcock
closed
2 years ago
0
Add microsoft converter
#59
kwalcock
closed
2 years ago
0
Update syntax
#58
kwalcock
closed
2 years ago
0
Combine S3 with Textract
#57
kwalcock
closed
2 years ago
0
Add extraction by Google
#56
kwalcock
closed
2 years ago
1
Evaluate quality of converters
#55
garanews
opened
2 years ago
1
Add textract converter
#54
kwalcock
closed
2 years ago
0
Add textract converter
#53
kwalcock
closed
2 years ago
0
Add word break by hyphen and ligature examples
#52
kwalcock
closed
2 years ago
0
At first just touch up, but
#51
kwalcock
closed
2 years ago
0
Fix Adobe namespace
#50
kwalcock
closed
2 years ago
0
Do something about number parameters
#49
kwalcock
closed
2 years ago
0
space-separated large numbers
#48
maxaalexeeva
closed
2 years ago
4
Experiment with PS
#47
kwalcock
closed
2 years ago
0
Update CHANGES
#46
kwalcock
closed
2 years ago
0
Don't always take word over raw
#45
kwalcock
closed
2 years ago
0
Add Adobe converter
#44
kwalcock
closed
2 years ago
0
Increase memory settings
#43
kwalcock
closed
2 years ago
0
Assimilate fancy trim of ScienceParse conversion
#42
kwalcock
closed
2 years ago
4
Test strange filenames
#41
kwalcock
closed
2 years ago
0
Check for a null Parser
#40
kwalcock
closed
2 years ago
0
Describe issues with memory
#39
kwalcock
closed
2 years ago
1
Special characters (like spaces) in filenames
#38
kwalcock
closed
2 years ago
1
fancier processing for science parse
#37
maxaalexeeva
closed
2 years ago
8
Set memory limits for different situations
#36
kwalcock
closed
2 years ago
0
Use cutoff in case restoration
#35
kwalcock
closed
2 years ago
0
Update documentation
#34
kwalcock
closed
2 years ago
0
Do not use the global, possibly implicit thread pool
#33
kwalcock
closed
2 years ago
0
Add loop parameter
#32
kwalcock
closed
2 years ago
0
Restore case
#31
kwalcock
closed
2 years ago
1
Use AppUtils.argsToMap
#30
kwalcock
closed
2 years ago
0
Kwalcock patch 1
#29
kwalcock
closed
2 years ago
0
Make better links to maven
#28
kwalcock
closed
2 years ago
0
Add badge and put links at the top
#27
kwalcock
closed
2 years ago
0
Do docs and source
#26
kwalcock
closed
2 years ago
0
Change release to maven
#25
kwalcock
closed
2 years ago
0
Add documentation
#24
kwalcock
closed
2 years ago
0
Make a reasonable CLI
#23
kwalcock
closed
2 years ago
0
Join lines that should not have been separated with a blank line
#22
kwalcock
closed
2 years ago
1
Try out scienceparse
#21
kwalcock
closed
2 years ago
3
Fix formatting issues
#20
kwalcock
closed
2 years ago
4
Next