ocropus-archive / DUP-ocropy

Python-based tools for document analysis and OCR
Apache License 2.0
3.41k stars 590 forks source link

Multithreaded Training #302

Closed Programmer888 closed 5 years ago

Programmer888 commented 6 years ago

I it possible to run ocropy training multithreaded like the page segmentation and the preprocessing? I currently train my models using the command ocropus-rtrain -c train/*/*.gt.txt test/*/*.gt.txt -o models/model train/*/*.bin.png

amitdo commented 6 years ago

it possible to run ocropy training multithreaded

No, it is not possible.

ChillarAnand commented 6 years ago

Duplicate of https://github.com/tmbdev/ocropy/issues/166

zuphilip commented 6 years ago

I just added a new FAQ for this: https://github.com/tmbdev/ocropy/wiki/FAQ#how-can-the-training-been-speed-up-is-it-possible-to-parallelize-the-trainig-or-to-use-gpu-power-for-training

ghost commented 6 years ago

@zuphilip have you tested ocropy2?

zuphilip commented 6 years ago

No, I haven't yet tested ocropy2. The best chances to hear more about it is to go to Vienna next week where @tmbdev is giving a workshop. Maybe there will also be some material shared afterwards online?

ghost commented 6 years ago

I hope that @tmbdev shares the workshop slides later on :sunglasses: