X-PLUG / mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Apache License 2.0
213 stars 17 forks source link

Finetuned weights for MSRVTT text-video retrieval #11

Open nguyenquangtan opened 12 months ago

nguyenquangtan commented 12 months ago

Hi, this is really a great work, thank you for sharing the code. Could you please release the MSRVTT finetuned weights for text-video retrieval task? That would be really helpful for me.