Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
https://minigpt-4.github.io
BSD 3-Clause "New" or "Revised" License
25.4k stars 2.91k forks source link

hi,can you release 7B ckpt in first stage? #116

Open wanghao-007 opened 1 year ago

wanghao-007 commented 1 year ago

hi,can you release 7B ckpt in first stage?thx

WangRongsheng commented 1 year ago

我不懂你想要问什么,但是我觉得 #81 可能对你有帮助

wanghao-007 commented 1 year ago

哈哈抱歉,我详细描述一下我的问题。我使用这里公布的MiniGPT-4 checkpoint(https://drive.google.com/file/d/1u9FRRBB3VovP1HxCAlpD9Lw4t4P6-Yq8/view?usp=share_link)直接做第二阶段的训练时出现这个维度错误:size mismatch for llama_proj.weight: copying a param with shape torch.Size([5120, 768]) from checkpoint, the shape in current model is torch.Size([4096, 768]). 我猜测这边给出的模型权重是13B llama训练的,所以想要7B训练的MiniGPT-4 checkpoint。谢谢。

TsuTikgiau commented 1 year ago

Hello! Sure, the first stage of 7B can be downloaded from here

Zhudogsi commented 1 year ago

哈哈抱歉,我详细描述一下我的问题。我使用这里公布的MiniGPT-4 checkpoint(https://drive.google.com/file/d/1u9FRRBB3VovP1HxCAlpD9Lw4t4P6-Yq8/view?usp=share_link)直接做第二阶段的训练时出现这个维度错误:size%E7%9B%B4%E6%8E%A5%E5%81%9A%E7%AC%AC%E4%BA%8C%E9%98%B6%E6%AE%B5%E7%9A%84%E8%AE%AD%E7%BB%83%E6%97%B6%E5%87%BA%E7%8E%B0%E8%BF%99%E4%B8%AA%E7%BB%B4%E5%BA%A6%E9%94%99%E8%AF%AF%EF%BC%9Asize) mismatch for llama_proj.weight: copying a param with shape torch.Size([5120, 768]) from checkpoint, the shape in current model is torch.Size([4096, 768]). 我猜测这边给出的模型权重是13B llama训练的,所以想要7B训练的MiniGPT-4 checkpoint。谢谢。

run demo就会出现这个问题,请问您这个问题解决了吗?怎么解决的呢?