ryanfb / latinocr-lat

'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata
https://ryanfb.github.io/latinocr/
Apache License 2.0
13 stars 3 forks source link

Add training against eMOP box/tiff files #2

Open ryanfb opened 9 years ago

ryanfb commented 9 years ago

https://github.com/Early-Modern-OCR/TesseractTraining/tree/master/FontTraining

Need to select an appropriate subset and remove e.g. Fraktur fonts.