openpaperwork / pyocr

A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
https://gitlab.gnome.org/World/OpenPaperwork/pyocr
931 stars 152 forks source link

[Enhancement]: enable parallel calls to the tesseract CLI #83

Closed a-pagano closed 6 years ago

a-pagano commented 6 years ago

Specifying the OMP_THREAD_LIMIT=1 parameter is the only way to make the calls to tesseract play nice with multiprocessing.

For more info see: https://github.com/tesseract-ocr/tesseract/issues/898

a-pagano commented 6 years ago

Sorry for the noise, I mistakenly created a PR against your master instead of my fork's master. I'll submit my PR again once I'll have cleaned and properly tested the changes

jflesch commented 6 years ago

no prob