Canjie-Luo / MORAN_v2

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition
MIT License
624 stars 152 forks source link

Datasets used to train provided demo.pth model? #141

Open saketdingliwal opened 3 years ago

saketdingliwal commented 3 years ago

It would be great if the authors could discuss the training datasets used to train the demo.pth model for MORAN. Is it only the SynthText [Jaderberg et al., Gupta et al. [15]] that was used for training the model or other datasets were also used to fine-tune the model. I am particularly asking for the weights in file demo.pth

Canjie-Luo commented 3 years ago

Yes, only the synthesized images were used for training.

M. Jaderberg, K. Simonyan, A. Vedaldi, and A. Zisserman. Synthetic data and artificial neural networks for natural scene text recognition. In Proceedings of Advances in Neural Information Processing Deep Learn. Workshop (NIPS-W), 2014.

A. Gupta, A. Vedaldi, and A. Zisserman. Synthetic data for text localisation in natural images. In Proceedings of Computer Vision and Pattern Recognition (CVPR), pages 2315–2324, 2016.

Please refer to https://github.com/Canjie-Luo/MORAN_v2#data-preparation.