MichalBusta / E2E-MLT

E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
MIT License
291 stars 84 forks source link

3 times call to function net.forward_ocr in method process_boxes #77

Open AniketGurav opened 2 years ago

AniketGurav commented 2 years ago

Hi, in function process_boxes net.forward_ocr is called 3 times. I am not clear about it. those lines no are 270,276,381 in train.py

By reading paper, what I understand is the function process_boxes ocr the crops extracted by the Localization Module LM. Those crops are extracted from the 1. bounding box coordinate extracted by LM and 2.feature map from one of the layer of LM.

But I am not clear about 3rd ocr call on line 381 above..

I have referred Fig 3 of your paper https://arxiv.org/pdf/1801.09919.pdf for understanding.

MichalBusta commented 2 years ago

Hi Aniket, 3rd call is training on GT boxes. It can speed up training in early stage (since a prediction network does not produce proposals with good overlap)

in short we train on:

Hope it helps, Michal

AniketGurav commented 2 years ago

Thank you for reply