issues
search
virantha
/
pypdfocr
Python script to do PDF OCR conversion using Tesseract
Apache License 2.0
372
stars
114
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Port pypdfocr to Python 3
#36
rmoretti
opened
8 years ago
1
Added in more options for watcher
#35
albertcbrown
opened
9 years ago
0
Using numeric keywords in config file triggers error
#34
marklagace
closed
8 years ago
1
How to improve the accuracy?
#33
MASantos
closed
8 years ago
1
Can't get started
#32
hjanjua
closed
9 years ago
1
Allow killing the process via Ctrl+C
#31
zaroth
closed
8 years ago
2
using pypdfocr from within a python program
#30
michaelsorich
closed
8 years ago
3
Make watchdog and evernote optional dependencies
#29
chmduquesne
closed
9 years ago
1
Could not understand output of pdfimages
#28
kundor
opened
9 years ago
4
Wrong y position
#27
Wikunia
closed
9 years ago
5
File name with escapable HTML character
#26
trybik
closed
8 years ago
1
I/O Error: Couldn't open file '-list'
#25
OUsteventhomas
closed
8 years ago
1
pypdfocr.exe is broken because of multiprocessing
#24
virantha
closed
8 years ago
7
Update pypdfocr.py - added option to turn off preprocessing step.
#23
ChristosT
closed
9 years ago
0
More options maybe...
#22
ChristosT
closed
9 years ago
3
"Too many open files" error
#21
coreyp
closed
9 years ago
4
problem with invoking pdfimages?
#20
rotheda
closed
9 years ago
1
Windows - point not allowed in filename
#19
toninlg
closed
10 years ago
5
Temporary fix for parser looking for html instead of hocr
#18
slaiyer
closed
10 years ago
2
Windows v 0.7.4 - text alignment
#17
toninlg
closed
10 years ago
7
include quickinstall script for PIL in case for Linux when "pip install pil" does not resolve
#16
hyperfl0w
closed
10 years ago
2
Dependancy has wrong link
#15
hyperfl0w
closed
10 years ago
1
AttributeError: 'unicode' object has no attribute 'seek'
#14
tringger
closed
10 years ago
5
Added -l option to set OCR language in tesseract and changed PIL to pill...
#13
gesellkammer
closed
10 years ago
1
Use pillow instead of PIL
#12
gesellkammer
closed
10 years ago
2
configurable language
#11
gesellkammer
closed
10 years ago
2
Pdf file size
#10
matteocrippa
closed
10 years ago
6
pypdfocr error [ValueError: invalid literal for int() with base 10: ''] in '\pypdfocr_pdf", line 98, in overlay_hocr'
#9
hsh001
closed
10 years ago
5
xml parse error on converting a non-searchable pdf to searchable pdf
#8
rajatdutta
closed
10 years ago
11
Pages in output PDF not guaranteed to be in correct order on every platform
#7
pjh1974
closed
10 years ago
3
Feature Request -- use file name to help find folders
#6
dlang123
closed
10 years ago
2
GhostScript on Windows
#5
dlang123
closed
10 years ago
4
Children with nested tags not overlaid.
#4
vigneshwerv
closed
10 years ago
6
Case sensitivity in filing module
#3
ghost
closed
10 years ago
1
Needs current version of tesseract
#2
ghost
closed
10 years ago
4
Update setup.py
#1
ghost
closed
11 years ago
0
Previous