Closed matthew77777 closed 1 year ago
Hi @matthew77777 . Have you solved the issue?
Your model works well on the train data (because the accuracy is 1.0), but works extremely bad on the validation data. It seems that your model overfits the trainset, cmiiw. I'm wondering how many data we need to get a good model :)
Anw @matthew77777 . Because you use a custom dictionary, I think you need to set character_type: ch
instead of EN
as explained in here.
Hi @fadamsyah
Thank you for the comments. I see so it was overfitting after all.
Actually, I was using 5000 images for training and I increased the number to 80000. Now I got a reasonable accuracy though, still not good enough.
I recommend you to try with more than 100000 images for training.
Thank you so much !
br,
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.
Hello
I'm trying to create a custom text recognition model with over 4000 images(icdar2015) following this to create the custom model with a custom dictionary.
It's increasing the acc while training and losing loss, however, at the end of the training, the acc drops so low like below.
Here below is the command and made.
And here is the YAML file.
Please someone give me advice to create a good recognition model.
Thank you so much,
br,