tesseract-ocr / tesstrain

Train Tesseract LSTM with make
Apache License 2.0
620 stars 181 forks source link

[app][feat] create training data from alto or page #199

Closed M3ssman closed 3 years ago

M3ssman commented 3 years ago

Some convenient scripts to create Trainingdata from ALTO V3 or Page 2013 with corresponding Images (TIF or JPG) from the context of Newspaper-Digitalisation and Training for FID Nahost.

stale[bot] commented 3 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

bertsky commented 3 years ago

Was this superseded by #205?

kba commented 3 years ago

Was this superseded by #205?

Yes.