ulb-sachsen-anhalt / ocrd-odem

OCR Workflows based on OCR-D
MIT License
3 stars 1 forks source link

Feature: Pass Language Models with different text orientation #14

Closed M3ssman closed 8 months ago

M3ssman commented 8 months ago

Description

Currently, the list of Tesseract language models which require special text orientation, is a hard-coded list in odem_commons.p:71 as RTL_LANGUAGES.

This doesn't work of course for any self-trained model which is not known forehand.

Please integrate a config option like model_rtl = ara, fas, heb and respect it's values at processing_ocrd.py#get_recognition_level , if set.

Otherwise use the common constant (see above) as default value.

einspunktnull commented 8 months ago

check it