mittagessen / kraken

OCR engine for all the languages
http://kraken.re
Apache License 2.0
729 stars 130 forks source link

Parallelization in ketos test #455

Closed colibrisson closed 10 months ago

colibrisson commented 1 year ago

ketos test has a --workers option but parallelization only happens at the data loader level. I guess it's because the tests used to be performed on GPU but it's not the case anymore. I was wondering if there was a good reason now for not using a process pool.

mittagessen commented 1 year ago

The middle-term plan is to factor out the testing into the proper pytorch-lightning routines which would make process-level parallelization somewhat difficult. That basically is the main reason.