openpaperwork / pyocr

A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
https://gitlab.gnome.org/World/OpenPaperwork/pyocr
930 stars 152 forks source link

Langs: can more languages be supported? #56

Closed fangfangbest closed 7 years ago

fangfangbest commented 7 years ago

i have got equ, eng and osd in the language list and if it is possible to support other languages like Chinese

jflesch commented 7 years ago

The list of languages available depends on what's installed on your system.

For example, on Ubuntu/Debian/etc, if you use Tesseract, you can install the Chinese data files with the following command: sudo apt install tesseract-ocr-chi-sim tesseract-ocr-chi-tra (chi-sim = simplified Chinese ; chi-tra = traditional Chinese).

For a complete list of supported languages: apt search tesseract-ocr- / Tesseract wiki