Open xwhkkk opened 1 year ago
The performance gains mainly come from the decoupling of visual and ID embeddings. In fact, the improvement is general and not limited to tiny or scale-changing objects. I highlighted those examples since R50-AOT-L usually fails in those cases.
Hello !
Thanks for sharing your great work!
In section 6.1, the paper mentioned the R50-DeAOT-L performs better than R50-AOT-L on tiny or scale-changing objects. I would like to know which module is beneficial to tiny or scale-changing objects.
Looking forward to your kind reply !