Thanks for your excellent job! I have some questions about the code.
I thought the VirConv-L is the early fusion. Why does the function "roi_grid_pool_mm exist" in the TEDMHead?
Just for code simplicity. The feature will be replaced by early fused feature if no image-only feature provided. Please refer to line 608 to 611 in ted_head.py .
Thanks for your excellent job! I have some questions about the code. I thought the VirConv-L is the early fusion. Why does the function "roi_grid_pool_mm exist" in the TEDMHead?