Open delveintodetail opened 4 years ago
SVTP has 645 images, if you carefully read each one, you will find there are around 20 images humans cannot recognize. How can this algorithm get 98.6%??
When seeing such so supervised improvement to the SOTA, you should carefully check your code.
Thank you for your reminding. Maybe there is something wrong with my code. I'll check it carefully sometime. As I am temporarily engaged in work, I am not very rich in time. There may be a delay. Sorry.
There are several papers that adopt transformer on OCR.
There are several papers that adopt transformer on OCR.
- NRTR: A no-recurrence sequence-to-sequence model for scene text recognition
- MASTER: Multi-Aspect Non-local Network for Scene Text Recognition
- A Simple and Strong Convolutional-Attention Network for Scene Text Recognition
Thanks very much, I will read later.
SVTP has 645 images, if you carefully read each one, you will find there are around 20 images humans cannot recognize. How can this algorithm get 98.6%??
When seeing such so supervised improvement to the SOTA, you should carefully check your code.