issues
search
jalan
/
pdftotext
Simple PDF text extraction
MIT License
870
stars
99
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
wrong text content with column inside page
#127
fukemy
opened
3 weeks ago
3
Empty lists for reactions_from_text_in pdf and OSError: [Errno 22] Invalid argument: 'C:\\Users\\Lyubomir/.torch/iopath_cache\\s/gxy11xkkiwnpgog\\publaynet-tf_efficientdet_d1.pth.tar?dl=1.lock' for reaction_from_figures_in_pdf
#124
LyuboKotop
opened
2 months ago
1
poppler/error: Failed to parse XRef entry [11].poppler/error: Top-level pages object is wrong type (null)
#123
juanfrilla
closed
3 months ago
1
Unable to install
#122
sxaxmz
closed
4 months ago
2
Not exactly an issue
#121
sheikgit
closed
7 months ago
1
#17 in arch linux
#120
dvtate
closed
10 months ago
9
Can't make crop work
#119
iconoclasthero
closed
11 months ago
1
Getting error Invalid ToUnicode Cmap
#118
de3
closed
11 months ago
2
I am getting this issue in python 3.7.7 macosm2
#117
ram8545
opened
1 year ago
0
Poppler/error seen while extracting text from PDF such as poppler/error (572194): Unknown filter 'JPXDecode'\n
#116
AsfarHorani
closed
1 year ago
2
PDF tags after converting tags from PDF
#115
Wowhere
closed
1 year ago
5
double column pdf
#114
malinphy
closed
1 year ago
2
Can't install using conda/mamba
#113
mdaeron
closed
1 year ago
4
not able to install in red-hat base image 8
#112
tiger-tribe-sunnyraj
closed
1 year ago
1
Provide access to page::text_list
#111
stefan6419846
opened
1 year ago
1
Formatting changed after new install
#110
tbrownhe
closed
1 year ago
4
Add a poppler_version function
#109
jalan
opened
1 year ago
0
Enable tests requiring at least version 0.88 if requirement is met
#108
stefan6419846
opened
1 year ago
3
Import error when running on MacOs (M1)
#107
Ethansev
closed
1 month ago
1
ImportError: DLL load failed while importing pdftotext: The specified module could not be found
#106
razbengera
closed
2 years ago
0
AttributeError: module 'pdftotext' has no attribute 'PDF'
#105
vignesh-bosch
closed
2 years ago
4
problems reading and maintaining the layout
#104
hian18
closed
2 years ago
2
Crash when PDF contains empty pages
#103
YasminaFr
closed
2 years ago
3
Unable to install pdftotext : poppler/cpp/poppler-document.h not found
#102
yashali
closed
2 years ago
4
Two different text output is returned
#101
pavankalyan066
closed
2 years ago
2
Symbol not found in flat namespace
#100
caseydm
closed
2 years ago
10
question about how to approach bonding box problem
#99
klebs6-x
opened
2 years ago
1
Can't import pdftotext in my Mac Apple Silicon M1
#98
anprieto
closed
2 years ago
7
Add option to hide clipped text and ignore diagonal text
#97
ReMiOS
closed
2 years ago
1
setup.py: handle missing brew command
#96
smancill
closed
2 years ago
3
macOS: `setup.py` fails when `brew` is not in PATH
#95
smancill
closed
2 years ago
1
I tried all the steps in the above column still no luck.
#94
ChefQ
closed
3 years ago
1
Extract text from scanned pdfs
#93
mapto
closed
1 year ago
3
Fails to install on Windows with Python 3.9
#92
TheQuinbox
closed
3 years ago
1
Red hat Linux env - unable to execute 'gcc': No such file or directory
#91
prasadgsk
closed
3 years ago
4
Can't install this on macOS Big Sur 11.6
#90
anshul-klaarhq
closed
3 years ago
3
raw=False argument not working in latest version
#89
surazgyawali
closed
3 years ago
2
After upgrade to 2.2.0 all strings are treated as separate lines
#88
sprnza
closed
3 years ago
2
Unicode Character 'ZERO WIDTH NON-JOINER' (U+200C)
#87
hosseinDev1
closed
3 years ago
1
Add brew support
#86
8W9aG
closed
3 years ago
4
Documentation and Header Text
#85
Tonystarq
closed
3 years ago
1
Pip install doesn't work
#84
Salz0
closed
3 years ago
1
Allow use of all three layout options
#83
jalan
closed
3 years ago
7
Cannot run in Apple M1 architecture
#82
LittleJymn
closed
3 years ago
3
This project should be released with GPL license
#81
elonzh
closed
3 years ago
4
Install on MacOS C++11 Error
#80
willmac321
closed
3 years ago
5
[CI] Build wheels for macOS, Linux and Windows
#79
bauerj
closed
2 years ago
9
Intercept poppler errors from appearing in stderr
#77
mrooding
closed
3 years ago
6
Left margin is cut off
#76
paulserafini
closed
3 years ago
3
Is it possible to find text coordinates on the page using pdftotext?
#75
jeanmonet
closed
4 years ago
4
Next