OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Apache License 2.0
1.33k stars 84 forks source link

Request to release multimodal finetuning for Internvideo2 #149

Open Varun-GP opened 2 months ago

Varun-GP commented 2 months ago

I request Authors to release finetuning for Internvideo2 model with multimodality: https://github.com/OpenGVLab/InternVideo/tree/main/InternVideo2/multi_modality#finetuning

Andy1621 commented 1 month ago

Hi! For retrieval and QA, you can use those finetuning code in UMT.