tesseract-ocr / tesstrain

Train Tesseract LSTM with make
Apache License 2.0
626 stars 180 forks source link

Finetuning indic langauge models on word level? #397

Open Souravakb24 opened 2 weeks ago

Souravakb24 commented 2 weeks ago

If i have a dataset for which the model is not performing well. The dataset is on word level then can i train the tesseract model on the same dataset.

stweil commented 2 weeks ago

I suggest to make your dataset public as a first step.