InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
1.92k
stars
121
forks
source link
Can I train a VL model based on Intern20B from scratch? #240
Open
lllllllll-3154 opened 3 months ago
Will you release the pretrain script for internlm-vl?