gxrxrdx / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
0 stars 0 forks source link

Whitelist not working on 3.02.02 for Python #1482

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. enter the following code at beginning of python script:

import tesseract
ocr = tesseract.TessBaseAPI()
ocr.SetVariable("tessedit_char_whitelist", "0123456789;")
ocr.Init("C:\\Program Files (x86)\\Tesseract-OCR\\","eng",tesseract.OEM_DEFAULT)

What is the expected output? What do you see instead?

It should be whitelisting those characters, but tesseract ignores it and will 
grab other characters anyway.  I've tried putting the SetVariable before and 
after Init, but that didn't change anything.

What version of the product are you using? On what operating system?
3.02.02 on Windows 8.1

Please provide any additional information below.
I do have pytesseract/python-tesseract installed as well, if that changes 
anything.

Thanks in advance for any help on this.  I've been trying to fix this problem 
all week without much progress.

Original issue reported on code.google.com by darke...@yahoo.com on 5 Jun 2015 at 4:58

GoogleCodeExporter commented 9 years ago
We do not provide support for tesseract wrapper see FAQ[1] for instructions
[1] https://code.google.com/p/tesseract-ocr/wiki/FAQ#Rules_and_advices

Original comment by zde...@gmail.com on 7 Jun 2015 at 1:58