clovaai / deep-text-recognition-benchmark

Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Apache License 2.0
3.69k stars 1.08k forks source link

Fine tuning problem! urgent #344

Open gitdeepheolp opened 2 years ago

gitdeepheolp commented 2 years ago

So i have trained from scratch using lmdb dataset from the repo then i tried to create my own dataset using trdg to add non latin characters and retrain the model. With the original dataset i got pretty good predictions but after i trained with the dataset that i created i got a very bad one. Please help ! Or should i train the model from scratch using non-latin characters ?

Another question is : after creating the dataset .mdb files i have to replace the datasets from validation folder also ?

This is confusing, we create the training model python3 create_lmdb_dataset.py --inputPath data/ --gtFile data/gt.txt --outputPath result/

But what about the validation datasets?

gitdeepheolp commented 2 years ago

Finally figured it out. You have to generate other images with trdg for validation. There's no mention of that or i missed it.

gitdeepheolp commented 2 years ago

So, i trained my model with english lmdb dataset then fine tuning with russian dataset and the model now recognize only the russian characters.....

MHasanlou1 commented 1 year ago

So, i trained my model with english lmdb dataset then fine tuning with russian dataset and the model now recognize only the russian characters.....

Could you please give me more detail? I trained this network for the Persian dataset from scratch, but the result is not good. How fine tunning on a network trained with a different dataset would be effective? Thanks

ftmasadi commented 1 year ago

Hello, I am also working on this topic. I hope I can communicate with you via email. Thanks