Open feemthan opened 3 months ago
Hi @feemthan, thank you for your interest. Could you elaborate how you are obtaining the distance values? Does the "distance" you print refer to "cosine similarity"? Cosine similarity should return a max similarity of 1, but the distance values you have printed are around 3. Could you check that the embeddings you are using are normalized?
Hello Team,
Thank you for your amazing work on this model. I was able to reproduce your remarkable results. I am looking to contribute and develop downstream inference using faiss but I am running to a lot of issues. The cosine similarity gives incorrect results.
08/03/2024 22:03:10 - INFO - main - Setup model... 08/03/2024 22:03:11 - INFO - main - Using CLIP pretrained weights... 08/03/2024 22:03:17 - INFO - main - Setup model done! Loaded existing embeddings. 08/03/2024 22:03:17 - INFO - main - Loading metadata... 08/03/2024 22:03:17 - INFO - main - Metadata loaded Top 5 results for 'a woman eating':
I use faiss.IndexFlatIP which is the inner product. How do I make better predictions on the MSRVTT dataset?