tesseract-ocr / tessdoc

Tesseract documentation
https://tesseract-ocr.github.io/tessdoc/
1.84k stars 363 forks source link

What dataset is base tesseract 5 trained on? #130

Closed Waji97 closed 4 months ago

Waji97 commented 6 months ago

I'm trying hard to find any official documentation or papers for Tesseract. I couldn't find any information about the dataset that was used to train it or any of the models that are available on GitHub.

Is there any information out there?

amitdo commented 6 months ago

https://github.com/tesseract-ocr/langdata_lstm/issues/1#issuecomment-421540072