Open CthulhuAIFrenzy opened 4 months ago
May I ask what your specific downstream task is? For instance, what are the modalities of the query and candidate? You can use the contrastive learning method to fine-tune Visualized BGE, and I can provide you with a not fully cleaned Stage-2 training code.
I want to try using multimodal fusion retrieval in product search and also try reranking with multimodal fusion retrieval to improve the rank-1 accuracy in rank-N candidates.
I can provide you with the original core training code for the stage 2 training process, which corresponds to multi-modal training in our paper. If needed, feel free to reach out to zhoujunjie [at] bupt [dot] edu [dot] cn.
I am currently working on a project that involves finetuning Visualized BGE. I have been able to successfully use the pretrained model, but now I would like to further finetune it for my specific use case.
Best regards,