tesseract-ocr / langdata

Source training data for Tesseract for lots of languages
Apache License 2.0
827 stars 886 forks source link

replace with a more representative bihari text #113

Closed Shreeshrii closed 6 years ago

Shreeshrii commented 6 years ago

recently committed training_text was more of a wordlist, without any punctuation