jwilk-archive / ocrodjvu

OCR for DjVu
GNU General Public License v2.0
45 stars 19 forks source link

Multiprocessing support #43

Closed FriedrichFroebel closed 2 years ago

FriedrichFroebel commented 2 years ago

I recently used ocrodjvu to add text layers to different DJVU files using Tesseract, with each file having 400-900 pages. Unfortunately, only one thread and less than one 1 GB of RAM has been used, making it quite slow and taking quite some time (example: 3 hours for nearly 900 pages), while the system load has been quite small even with my rather old hardware.

For this reason I would like to see some real multiprocessing support which speeds up the execution.

FriedrichFroebel commented 2 years ago

Nevermind, I must have overseen the -j option which seems to already support this.