openpaperwork / pyocr

A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
https://gitlab.gnome.org/World/OpenPaperwork/pyocr
931 stars 152 forks source link

Adding languages #63

Closed singhketan closed 7 years ago

singhketan commented 7 years ago

I'm new to pyocr and ocr in general. I'm trying to use pyocr for languages such as french, chinese etc, but the get_available_languages returns only 3 options: osd, eng, equ. How can I add other languages?

jflesch commented 7 years ago

On Debian or Ubuntu: apt install tesseract-ocr-fra tesseract-ocr-chi-sim

jflesch commented 7 years ago

Basically, you (usually) install language data files for Tesseract in the same way you installed Tesseract.