[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
149
stars
8
forks
source link
the Mean Accuracy in table1 means evaluation of ovcoco(65) or coco(80)? #1
Open
eternaldolphin opened 9 months ago
coco(80)