ljwztc / CLIP-Driven-Universal-Model

[ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.
Other
521 stars 58 forks source link

Image embedding space - Fig. 6 #62

Closed DrMachado closed 1 month ago

DrMachado commented 6 months ago

Hello and thank you for this great work ! It is unclear to me how to extract the CLIP label encoding for downstream tasks as shown in Fig. 6 of the manuscript, with the t-SNE plot of the embedding space. Thank you for your time.

ljwztc commented 5 months ago

After obtaining the feature map, you can scale up to the input size. Then you can find the feature vector for each category with ground truth.