LeoFCardoso / pdf2pdfocr

A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!
Apache License 2.0
274 stars 35 forks source link

file already has text and check text mode is enabled. Exiting. #2

Closed icegood closed 7 years ago

icegood commented 7 years ago

Hi, i've obtained next error while trying to add ocr to pdf: --> Errors/Warnings:

already has text and check text mode is enabled. Exiting. You may find 'wrong' pdf from google drive: https://drive.google.com/open?id=0B4mLkzBXmYycQ2N5OGpneWd5dzQ
LeoFCardoso commented 7 years ago

Hi Icegood. What OS are you using? Which command line flags did you use? Please check you are not using "-t" flag. For example: pdf2pdfocr.py -i ./file.pdf

LeoFCardoso commented 7 years ago

Closing this issue as it seems that "-t" flag was incorrectly used.

icegood commented 7 years ago

Sorry, Leo. Had no time to respond. I don't use any flags. Only -i. Have you checked file that i shared above? You cannot convert even without -t flag.

LeoFCardoso commented 7 years ago

I could convert the file by just using "-i" flag. I'm reopening the issue as it's not working to you. Please inform: OS that you are using? Python version? Complete command line used? All messages generated in stdout?

Thanks!

icegood commented 7 years ago

OS: Windows 10 x64 Python version : Python 3.6.3 (v3.6.3:2c5fed8, Oct 3 2017, 17:26:49) other tools: tesseract-ocr-setup-3.05.01 gs9.22 ImageMagick-7.0.7-Q16 x64

run via pdf2pdfocr.vbs script.

Now i see on screen that "Use for default [-stp -j 0.9] or <.> for last used option []." so -t was default. Please, close bug as it not valid anymore.

LeoFCardoso commented 7 years ago

Great!