-
Could you explain how you concluded that the difference between the similarity score of AI-generated images with text and that of real images with text would increase when mixing AI-generated and real…
-
I met the same error as https://github.com/rom1504/clip-retrieval/issues/345 when I used `clip-retrieval inference` command to extract images and corresponding texts features, my command is like foll…
-
Hello, can you give the checkpoint of Image Text Retrieval, your link in VinVL_MODEL_ZOO. md is wrong, thank you!
QC-LY updated
2 years ago
-
#### **Data we can collect using public resources**
1. **Engagement Metrics**:
- Data: Likes, comments, and shares for posts/accounts.
- Source: Instagram Graph API (limited to authenticate…
-
### Feature request
I propose adding a loss calculation for QFormer training in the BLIP-2 model. Implementing this feature would allow fine-tuning the QFormer and language models for image-text retr…
-
Can you share the img2text model.pth. I want to use this work for inference on other datasets.
-
I want to perform a zero shot retrieval evaluation on Flickr 30K for a COCO fine-tuned model. , what should I do?
-
# URL
- https://arxiv.org/abs/2411.02571
# Authors
- Sheng-Chieh Lin
- Chankyu Lee
- Mohammad Shoeybi
- Jimmy Lin
- Bryan Catanzaro
- Wei Ping
# Abstract
- State-of-the-art retrieval mod…
-
### Checklist
- [X] I added a very descriptive title to this issue.
- [X] I included a link to the documentation page I am referring to (if applicable).
### Issue with current documentation:
How to…
-
Currently, we use text embeddings. This is fine for textual documents, while it present obvious drawbacks for documents containing non-textual content (images, graphs, schemes, …).
An alternative, is…