virantha pypdfocr issues

virantha / pypdfocr

Python script to do PDF OCR conversion using Tesseract

Apache License 2.0

372 stars 114 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Update pypdfocr_pdf.py

#86 ricardopimentel closed 1 year ago
0
salvo

#85 ricardopimentel closed 1 year ago
0
docs: Fix a few typos

#84 timgates42 opened 2 years ago
0
Minor formatting proposals (command options)

#83 kant opened 5 years ago
0
blPara attribute missing

#82 MikeLeo40 closed 5 years ago
2
check if version nomeclature is digit

#81 swoldetsadick closed 4 years ago
0
Fails to run on Mac OS High Sierra

#80 christmasjumper opened 6 years ago
11
Resolves ValueError for string representations of floats.

#79 dynamicwebpaige opened 6 years ago
0
Change default thread number from 4 to number of CPUs available

#78 mrpg opened 6 years ago
0
Python 3 compliant print statement

#77 ecatkins closed 6 years ago
0
Fix for PIL.Image.DecompressionBombError errors

#76 nivaca opened 6 years ago
2
Cannot install pypdfocr using pip3 : Syntaxerror in file

#75 srdg opened 6 years ago
6
I think this library does too much

#74 guettli opened 6 years ago
1
Can't install on mac

#73 hjanjua opened 6 years ago
2
Use version from packaging to compare versions

#72 Konubinix closed 4 years ago
2
Python 3 compatability

#71 benjsec opened 6 years ago
4
Feature/dropbox

#70 stefangorling opened 6 years ago
0
Python 3 compatibility (WIP)

#69 benjsec closed 6 years ago
2
Ghostscript execution fails on Windows 10

#68 r4ph43l-GitHub opened 6 years ago
1
Output of 2to3.

#67 dpnova opened 6 years ago
0
Windows: errors with imagemagick's deprecated 'identify' command

#66 phren0logy opened 7 years ago
0
Unable to convert PDF (OSX 10.13 build 17A358a)

#65 Bobspadger opened 7 years ago
1
Mixed dpi images per pdf page - configurable dpi default and/or mixed mode?

#64 clowtown opened 7 years ago
1
"brew install proppler" does not work any more?

#63 nicozhang opened 7 years ago
1
Remove existing text layer before writing final file

#62 getglad opened 7 years ago
0
Error on running pypdfocr

#61 ediwill opened 7 years ago
5
Is always generating a file with 306 bytes

#60 caitifbrito opened 7 years ago
2
PyPDF fails

#59 mauro1855 opened 7 years ago
0
Cannot find text.pdf file

#58 getglad opened 7 years ago
2
proposal for typo correction

#57 Xophe92 opened 7 years ago
0
Could not execute tesseract

#56 iiitmahesh opened 7 years ago
7
Install without evernote support

#55 oksigma closed 7 years ago
2
Fixes: WARNING: Could not execute identify to calculate DPI

#54 rasa opened 7 years ago
1
2 small spelling mistakes

#53 fliiiix opened 7 years ago
0
Specify postfix of ocr'd pdf in config

#52 mikafinja opened 7 years ago
0
update overlay_hocr_pages

#51 hazbut opened 7 years ago
0
Option to overwrite original file?

#50 ryoung81 opened 7 years ago
0
Having Problem with pypdfocr on Windows 2008 R2

#49 ahp38 closed 7 years ago
2
specifying dpi/duplicate text

#48 burrelvannjr opened 8 years ago
1
Fixed depreciated link to Tesseract

#47 8bit-pixies opened 8 years ago
0
hocr2pdf.py dropping one character (or last character from) words?

#46 zhoujianfu closed 8 years ago
1
Unable to run pypdfocr.exe ver 0.9.0

#45 djordje-m closed 7 years ago
2
When identifying original pdf file, skip the first output lines that …

#44 BrentNoorda opened 8 years ago
1
Unable to process pdfs - Windows

#43 fraserpage closed 7 years ago
8
To .txt format

#42 ghost opened 8 years ago
0
Looking for text.pdf that does not exist

#41 tadamhicks closed 7 years ago
8
Could not execute pdfimages to calculate DPI

#40 tadamhicks opened 8 years ago
2
[Request] provide different quality image files for ocr and final merging

#39 sekisushai opened 8 years ago
1
Unable to quit using Ctrl-C

#38 manastungare closed 8 years ago
1
Large multi page PDFs increase in processing time expotentially.

#37 matt12eagles closed 8 years ago
8