What is CLIP upper bound?How did you get the model?

altndrr / vic

Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification

MIT License

100 stars 3 forks source link

Thank you for your work, I have some questions and hope you can answer them despite your busy schedule What is CLIP upper bound?How did you get the model?

We consider three main groups of baselines for our comparisons. The most straightforward baselines consist of using CLIP with large vocabularies, such as WordNet [41] (117k names) or the English Words (234k names [16]). As an upper bound, we also consider CLIP with the perfect vocabulary, i.e. the ground-truth names of the target dataset (CLIP upper bound). Due to lack of space, we only report results for CLIP with ViT-L [13].

altndrr / vic

What is CLIP upper bound?How did you get the model? #19