X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
https://www.modelscope.cn/studios/damo/mPLUG-Owl
MIT License
2.25k stars 171 forks source link

How to finetune with video input? #122

Closed pku-tyh closed 1 year ago

pku-tyh commented 1 year ago

I only find how to finetune with text-only tasks and image-text tasks. Could you please give me a guide for finetuning with video tasks?