tesseract-ocr / langdata

Source training data for Tesseract for lots of languages
Apache License 2.0
834 stars 888 forks source link

what's the difference among chi_sim.traineddata and these chi_sim.* files? #93

Closed wamlvaw closed 6 years ago

wamlvaw commented 7 years ago

my tesseract-ocr is the 3.04 edition, however on the RPi, I ran the command of "sudo apt-get install tesseract-ocr-chi-sim". I tested a picture and it showed "Error opening data file /usr/local/share/tessdata/chi_sim.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory. Failed loading language 'chi_sim' Tesseract couldn't load any languages! " And I guessed whether the edition is 3.02. When I find these files, I'm lost in how to transfer them into chi_sim.traineddata. Could you please give me several suggestions? Thank you!

Shreeshrii commented 7 years ago

Please see https://github.com/tesseract-ocr/tesseract/wiki/Data-Files

You may want to upgrade your tesseract-ocr installation. https://launchpad.net/~alex-p/+archive/ubuntu/tesseract-ocr

Shreeshrii commented 6 years ago

@zdenop This issue can be closed.