virantha / pypdfocr

Python script to do PDF OCR conversion using Tesseract
Apache License 2.0
372 stars 114 forks source link

Could not execute tesseract #56

Open iiitmahesh opened 7 years ago

iiitmahesh commented 7 years ago

How to slove this problem , I tried both python2 and 3 uuntu 16.04.

ERROR:
Could not execute tesseract Please make sure you have Tesseract installed correctly

asifali22 commented 7 years ago

I have the same issue and Tesseract is installed correctly.

virantha commented 7 years ago

Please see the advanced options in the online usage guide to specify the tesseract location.

If you can send me more details on where your tesseract binary is located, I can try to improve the binary auto location.

tranhungt commented 7 years ago

Do a which tesseract to see if it can be found. It might have to do with exporting it to your path.

vvksingh24 commented 6 years ago

I was having the same issue this can be because of you have installed pytesseract but not tesseract you can install it sudo apt install tesseract-ocr if you are using ubuntu

f0rdprefect commented 5 years ago

Had this, too. looks like the command line for tesseract invocation needs another dash. Search for -psm and replace with --psm

pidugusundeep commented 5 years ago

where do i search for that @f0rdprefect ? to replace with --psm ?

f0rdprefect commented 5 years ago

pypdfocr_tesseract.py -- I forked this repo and tried to put it into a .snap. Need more time to figure out how to properly expose other commands inside the snap. Anyhow I fixed the --psm switch and the requirements to reportlab version 3.4.0.

Would be good to know either how to make it run with an up to date reportlab / how to create a snap which will properly work...