-
### Checklist
- [X] I added a very descriptive title to this issue.
- [X] I included a link to the documentation page I am referring to (if applicable).
### Issue with current documentation:
How to…
-
Hello all @jhyuklee @rajeshkp @donghyeonk @cbhyphen @wonjininfo. I want to use biobert as text encoder in a multimodal architecture for report retrieval task for an image? Can you suggest me which ver…
-
Apologies if this is a n00b question. I'm trying out the multimodal embedding examples here:
https://lancedb.github.io/lancedb/embeddings/default_embedding_functions/
Retrievals aren't too bad for C…
shaqq updated
3 months ago
-
HI!
I am trying to use the eval_coco_retrieval script. However, I am running into the following error:
File "evaluation/eval_coco_retrieval.py", line 33, in
from multimodal_bert.datasets…
-
I am a student in Toronto learning about multimodal models and multimodal retrieval.
Can embeddings be extracted from your models ?
I would like to compare retrieval results from your model to CLIP.…
-
I want to use images as queries for retrieval. Can I modify it by directly replacing the model in embedding_func with a multimodal large model? If not, please tell me what needs to be changed. Thank y…
-
https://zhizhou-yu.github.io/2020/06/14/Multimodal-Papers-Reading-Notes.html#reference
多模态学习相关的论文阅读,包含多模态表示学习(Multimodal Representation Learning)、多模态检索(Multimodal Retrieval)、多模态匹配(Text-image Matchi…
-
Hi JUNJIE. In "train.bash," I found that you locked the text tower and only trained the vision tower. The weights of the text tower (BGE) are already pre-trained (BAAI/bge-base-en-v1.5), so during the…
-
![image](https://user-images.githubusercontent.com/129525/66028009-847d1100-e53f-11e9-80a5-9df94d0f25f0.png)
https://arxiv.org/pdf/1909.12932.pdf
silky updated
5 years ago
-
I am currently working on a project that involves finetuning Visualized BGE. I have been able to successfully use the pretrained model, but now I would like to further finetune it for my specific use …