FangShancheng / ABINet

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
Other
438 stars 73 forks source link

About the method of rectify the image first, and then perform scene text recognition #60

Open HHeracles opened 2 years ago

HHeracles commented 2 years ago

In order to improve the accuracy of scene text recognition, some existing methods first rectify the text image(e.g. Iterative Image Rectification) and then recognize text. ABINet does not adopt the method of correction first and then recognize text, is it because the correction cannot improve the identification accuracy? What do you think of this kind of methods? Have you tried this idea?

FangShancheng commented 2 years ago

Actually we did not try Image Rectification as we basically focus on language modeling problems. But I think this is an effective method to additionally integrate rectification if you just want to boost accuracy.