ruediger / VobSub2SRT

Converts VobSub subtitles (.idx/.srt format) into .srt subtitles.
GNU General Public License v3.0
293 stars 65 forks source link

idx language code "zh" gets expanded to "zho", not recognized by tesseract #78

Open Seegras opened 4 years ago

Seegras commented 4 years ago

I actually opened an issue there too: https://github.com/tesseract-ocr/tesseract/issues/2995

$ grep id: subs-chi.idx id: zh, index: 0 $ vobsub2srt subs-chi Error opening data file /usr/share/tesseract-ocr/4.00/tessdata/zho.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory. Failed loading language 'zho' Tesseract couldn't load any languages! Failed to initialize tesseract (OCR).