Closed DrMachado closed 1 month ago
Hello and thank you for this great work ! It is unclear to me how to extract the CLIP label encoding for downstream tasks as shown in Fig. 6 of the manuscript, with the t-SNE plot of the embedding space. Thank you for your time.
After obtaining the feature map, you can scale up to the input size. Then you can find the feature vector for each category with ground truth.
Hello and thank you for this great work ! It is unclear to me how to extract the CLIP label encoding for downstream tasks as shown in Fig. 6 of the manuscript, with the t-SNE plot of the embedding space. Thank you for your time.