X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
https://www.modelscope.cn/studios/damo/mPLUG-Owl
MIT License
2.25k stars 171 forks source link

论文中图与描述(代码)不一致 #45

Closed WooKimm closed 1 year ago

WooKimm commented 1 year ago

文中 Fig2 里 stage2 画的是Abstractor冻住了,但是论文描述和代码里似乎都是训练的?

LukeForeverYoung commented 1 year ago

Sorry, visual abstractor should not be trained during the instruction finetuning stage. The code provided in this repository has been updated, and we will correct the description in the paper as soon as possible. It was an early experimental attempt to train visual abstractor during instruction finetuning, but later we found that setting requires_grad to False can achieve similar performance.