InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
1.91k
stars
120
forks
source link
InternLM-XComposer2-4KHD对中文文本图像识别和理解能力怎么样? #293
Open
lhanchao777 opened 2 months ago
InternLM-XComposer2-4KHD看论文中都是英文的数据集,这个对中文文本图像的识别和理解怎么样?有测评过吗?