issues
search
galkahana
/
pdf-text-extraction
cli for extracting text from PDF files (and maybe possibly tables)
Apache License 2.0
74
stars
19
forks
source link
Word tables
#11
Closed
galkahana
closed
1 year ago
galkahana
commented
1 year ago
support ms word tables.
while at it:
improved spacing algorithm by correctly reading the font width for space
more safety into internal table parsing
support ms word tables.
while at it: