clin1223 / VLDet

[ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)
Other
177 stars 11 forks source link

What's the meaning for alignment scores? #15

Open JiuqingDong opened 1 year ago

JiuqingDong commented 1 year ago

In section 3.2, The cost between image regions and words is defined as the alignment scores S = WR⊤ . The bipartite matching problem can then be efficiently solved by the off-the-shelf Hungarian Algorithm.

The input of the Hungarian algorithm is a cost matrix. So, that means alignment score S is the cost matrix? But, why did you call WR⊤ alignment score?