issues
search
jlsutherland
/
doc2text
Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.
MIT License
1.27k
stars
97
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Image not cropped accurately
#35
tekurkaa
opened
1 year ago
0
No module name PythonMagick
#34
atul219
opened
3 years ago
2
Added command for pip3 installation with python 3
#33
Quint-Anir
closed
7 months ago
0
Does is support stream data ?
#32
multinucliated
opened
4 years ago
0
Maybe a stupid question about the api, can't find in source code
#31
LongxingTan
closed
4 years ago
0
AttributeError: 'Page' object has no attribute 'image' ISSUE
#30
angelo337
closed
6 years ago
1
FileNotFoundError
#29
jashuRc
opened
6 years ago
0
ModuleNotFoundError: No module named 'PyPDF2'
#28
alexauvray
opened
6 years ago
1
Can not install pythonmagick.
#27
dyllanwli
closed
7 years ago
2
Eror on pip install PythonMagick
#26
liber145
closed
7 years ago
2
Python 3 compatibility fix
#25
andjelx
opened
7 years ago
1
Python 3.5 compatibility
#24
andjelx
opened
7 years ago
6
text extraction from png files does not seem to work
#23
vsriram28
opened
7 years ago
0
Unable to process
#22
alonecoder1337
opened
7 years ago
0
Question: Support for Windows
#21
modulexcite
opened
7 years ago
0
Merge pull request #1 from jlsutherland/master
#20
avi-levy
closed
8 years ago
0
Error passing the lang to the class
#19
crgimenes
closed
8 years ago
1
Add supports for lang parameter
#18
rcatajar
closed
8 years ago
2
PEP8 and python3 support
#17
rcatajar
closed
8 years ago
1
Compile opencv in /tmp
#16
rcatajar
closed
8 years ago
1
Support for non scanned documents (.doc, .docx, regular pdf)
#15
rcatajar
opened
8 years ago
4
Error on doc.process()
#14
rsteca
opened
8 years ago
2
Get an homogeneous background for better thresholding results
#13
remi-pr
opened
8 years ago
3
Use nproc(1) to determine number of make jobs
#12
jwilk
closed
8 years ago
3
What is wrong with this ? Can someone please explain ?
#11
iamvc7
opened
8 years ago
1
it'd be nice if this could produce text-overlaid PDFs
#10
jbothma
opened
8 years ago
7
Does not work on python3
#9
lervag
closed
8 years ago
2
support for image/jpeg
#8
belwase
closed
8 years ago
1
Can't pip install this
#7
Unrepentant-Atheist
closed
8 years ago
1
Fixed issue with wrong number of variables in function return
#6
achikin
closed
8 years ago
6
issue with extract_text
#5
rsteca
closed
8 years ago
1
Fixed a typo in variable name
#4
achikin
closed
8 years ago
1
Fixes 'Document instance has no attribute 'file_basename''
#3
achikin
closed
8 years ago
1
AttributeError: Document instance has no attribute 'file_basename'
#2
jwilk
closed
8 years ago
1
README: fix duplicate word
#1
jwilk
closed
8 years ago
1