X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
https://www.modelscope.cn/studios/damo/mPLUG-Owl
MIT License
2.25k stars 171 forks source link

video checkpoint #156

Open Xiuyuan-Chen opened 1 year ago

Xiuyuan-Chen commented 1 year ago

I wonder how the video checkpoint on hugging face was obtained? Did you use only picture data or video data as in the paper?thx!