wusize / ovdet

[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection
https://openaccess.thecvf.com/content/CVPR2023/papers/Wu_Aligning_Bag_of_Regions_for_Open-Vocabulary_Object_Detection_CVPR_2023_paper.pdf
Other
176 stars 4 forks source link

About coco_clip_hand_craft_attn12.npy #9

Closed wwiwush closed 1 year ago

wwiwush commented 1 year ago

Thanks for this nice work! I'd like to know how was coco_clip_hand_craft_attn12.npy generated and the difference between coco_clip_hand_craft_attn12.npy and coco_clip_hand_craft.npy. Thank you!

wusize commented 1 year ago

Hi, thanks for your interest! This to alleviate overfitting when training COCO. We use the output of the last (12th) attention layer for classification. Details are here. You can also refer to the last paragraph of S1 section in the supplementary material.