Closed hhaAndroid closed 9 months ago
Hi, we use the owlvit-base-patch16 ckpt, which is better than owlvit-base-patch32 for OVD.
As noted in our supplementary file, we use a score filtering threshold of 0.1 for owlvit, which is its default value for OVD.
We think these two aspects may be the reason for the performance difference (9.0 you reproduced vs 8.6 in our paper for FULL intra-scenario mAP). It is possible that specific choice of weight to load and threshold can impact the performance of owlvit on $D^3$. Actually, 0.05 or 0.1 may not be the best value and tuning it to be 0.06, 0.07 etc. may be better. We did not investigate this very detailedly as it does not affect the analysis in the paper. We are very thankful for pointing this out.
This issue is not active for some time so I'm closing it for now. Feel free to reopen it or open another one if any questions arises.
作者好,我直接跑了你提供的 owl-vit 代码,权重是 google/owlvit-base-patch32,但是结果和论文不一样,高不少,想请问原因可能是啥