InternLM-XComposer2-VL -> InternLM-XComposer2 的训练脚本

InternLM / InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

2.06k stars 128 forks source link

请问从InternLM-XComposer2-VL得到InternLM-XComposer2的训练应该怎么做呢？是采用https://github.com/InternLM/InternLM-XComposer/blob/main/finetune 这里的 code吗？如果是的话，

指定 pretrained path 为 VL 版本的 path。 2.1. image size 设置成 224 ，训练代码报错，提示模型 dimension 不匹配。 2.2. image size 设置成 490 ，训练速度非常慢。（数据同样是图文创作数据，一句话里有多张图像）不太确定为什么两个版本要特地区分 image size，以及 VL 模型做 instruction fine-tune 的代码。感谢🙏

InternLM / InternLM-XComposer