raffaeldantas / tesseract-ocr

Automatically exported from code.google.com/p/tesseract-ocr
Other
1 stars 0 forks source link

Windows executable requires English training data. #1432

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. Download latest version of portable tesseract.
2. Remove eng.traineddata and add other other traindeddata file, f.e 
ara.traineddata
3. Run tesseract input_img.png output -l ara

What is the expected output? What do you see instead?
It is expected for tesseract to run without any problems and produce output 
file because we deliberately specified the language.

But instead one gets the following message:
Error opening data file path\to\tesseract\binary\tessdata/eng.traineddata
Please make sure the TESSDATA_PREFIX environment variable is set to the parent 
directory of your "tessdata" directory.
Failed loading language 'eng'
Tesseract couldn't load any languages!
Could not initialize tesseract.

What version of the product are you using? On what operating system?
tesseract 3.02
 leptonica-1.68 (Mar 14 2011, 10:43:03) [MSC v.1500 LIB Release 32 bit]
  libgif 4.1.6 : libjpeg 8c : libpng 1.4.3 : libtiff 3.9.4 : zlib 1.2.5

Windows 7 x64

If one adds eng.traineddata back to tessdata directory, everything works as 
expected.

Original issue reported on code.google.com by mariusz....@gmail.com on 6 Mar 2015 at 6:00

GoogleCodeExporter commented 8 years ago
It is already fixed in current code. 

Original comment by zde...@gmail.com on 13 Apr 2015 at 7:58