issues
search
virantha
/
pypdfocr
Python script to do PDF OCR conversion using Tesseract
Apache License 2.0
372
stars
114
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update pypdfocr_pdf.py
#86
ricardopimentel
closed
1 year ago
0
salvo
#85
ricardopimentel
closed
1 year ago
0
docs: Fix a few typos
#84
timgates42
opened
2 years ago
0
Minor formatting proposals (command options)
#83
kant
opened
5 years ago
0
blPara attribute missing
#82
MikeLeo40
closed
5 years ago
2
check if version nomeclature is digit
#81
swoldetsadick
closed
4 years ago
0
Fails to run on Mac OS High Sierra
#80
christmasjumper
opened
6 years ago
11
Resolves ValueError for string representations of floats.
#79
dynamicwebpaige
opened
6 years ago
0
Change default thread number from 4 to number of CPUs available
#78
mrpg
opened
6 years ago
0
Python 3 compliant print statement
#77
ecatkins
closed
6 years ago
0
Fix for PIL.Image.DecompressionBombError errors
#76
nivaca
opened
6 years ago
2
Cannot install pypdfocr using pip3 : Syntaxerror in file
#75
srdg
opened
6 years ago
6
I think this library does too much
#74
guettli
opened
6 years ago
1
Can't install on mac
#73
hjanjua
opened
6 years ago
2
Use version from packaging to compare versions
#72
Konubinix
closed
4 years ago
2
Python 3 compatability
#71
benjsec
opened
6 years ago
4
Feature/dropbox
#70
stefangorling
opened
6 years ago
0
Python 3 compatibility (WIP)
#69
benjsec
closed
6 years ago
2
Ghostscript execution fails on Windows 10
#68
r4ph43l-GitHub
opened
6 years ago
1
Output of 2to3.
#67
dpnova
opened
6 years ago
0
Windows: errors with imagemagick's deprecated 'identify' command
#66
phren0logy
opened
7 years ago
0
Unable to convert PDF (OSX 10.13 build 17A358a)
#65
Bobspadger
opened
7 years ago
1
Mixed dpi images per pdf page - configurable dpi default and/or mixed mode?
#64
clowtown
opened
7 years ago
1
"brew install proppler" does not work any more?
#63
nicozhang
opened
7 years ago
1
Remove existing text layer before writing final file
#62
getglad
opened
7 years ago
0
Error on running pypdfocr
#61
ediwill
opened
7 years ago
5
Is always generating a file with 306 bytes
#60
caitifbrito
opened
7 years ago
2
PyPDF fails
#59
mauro1855
opened
7 years ago
0
Cannot find text.pdf file
#58
getglad
opened
7 years ago
2
proposal for typo correction
#57
Xophe92
opened
7 years ago
0
Could not execute tesseract
#56
iiitmahesh
opened
7 years ago
7
Install without evernote support
#55
oksigma
closed
7 years ago
2
Fixes: WARNING: Could not execute identify to calculate DPI
#54
rasa
opened
7 years ago
1
2 small spelling mistakes
#53
fliiiix
opened
7 years ago
0
Specify postfix of ocr'd pdf in config
#52
mikafinja
opened
7 years ago
0
update overlay_hocr_pages
#51
hazbut
opened
7 years ago
0
Option to overwrite original file?
#50
ryoung81
opened
7 years ago
0
Having Problem with pypdfocr on Windows 2008 R2
#49
ahp38
closed
7 years ago
2
specifying dpi/duplicate text
#48
burrelvannjr
opened
8 years ago
1
Fixed depreciated link to Tesseract
#47
8bit-pixies
opened
8 years ago
0
hocr2pdf.py dropping one character (or last character from) words?
#46
zhoujianfu
closed
8 years ago
1
Unable to run pypdfocr.exe ver 0.9.0
#45
djordje-m
closed
7 years ago
2
When identifying original pdf file, skip the first output lines that …
#44
BrentNoorda
opened
8 years ago
1
Unable to process pdfs - Windows
#43
fraserpage
closed
7 years ago
8
To .txt format
#42
ghost
opened
8 years ago
0
Looking for text.pdf that does not exist
#41
tadamhicks
closed
7 years ago
8
Could not execute pdfimages to calculate DPI
#40
tadamhicks
opened
8 years ago
2
[Request] provide different quality image files for ocr and final merging
#39
sekisushai
opened
8 years ago
1
Unable to quit using Ctrl-C
#38
manastungare
closed
8 years ago
1
Large multi page PDFs increase in processing time expotentially.
#37
matt12eagles
closed
8 years ago
8
Next