InternLM / InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
1.91k stars 120 forks source link

InternLM-XComposer2-4KHD对中文文本图像识别和理解能力怎么样? #293

Open lhanchao777 opened 2 months ago

lhanchao777 commented 2 months ago

InternLM-XComposer2-4KHD看论文中都是英文的数据集,这个对中文文本图像的识别和理解怎么样?有测评过吗?