Acceleration of the launch

ocrmypdf / OCRmyPDF-EasyOCR

OCRmyPDF EasyOCR plugin

MIT License

50 stars 9 forks source link

This step is done per worker process, so each process to create its own. After creating a Reader, the object is used for each page that worker processes, so for high page count documents we get optimal throughput. It's only if you have a lot of small documents that easyocr is less optimal.

This is the earliest it can be done without having some way to persist worker process across OCR jobs which would be a major overhaul of core ocrmypdf and breaking compatibility with all other plugins, because the current multiprocessing setup is that we create workers on demand for a specific task and then dispose of them.

ocrmypdf / OCRmyPDF-EasyOCR

Acceleration of the launch #16