ciur / papermerge

Open Source Document Management System for Digital Archives (Scanned Documents)
https://papermerge.com
Apache License 2.0
2.41k stars 257 forks source link

Aditional languages (rus, ukr) doesn't choosing in OCR language list #612

Open deimjons opened 2 months ago

deimjons commented 2 months ago

Description Hello. I am using a custom docker image with Russian and Ukrainian language packages for tesseract installed, following instructions: https://docs.papermerge.io/3.0/setup/add-ocr-langs/ Dockerfile:

FROM papermerge/papermerge:3.1

# add Ukrainian and Russian OCR languages
RUN apt install tesseract-ocr-rus tesseract-ocr-ukr

Info:

Screenshot 2024-04-19 at 18 45 39
ciur commented 2 months ago

Yes, because "rus" and "ukr" language codes are missing in following places:

I would gladly accept your pull request.