InternLM / InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
1.92k stars 121 forks source link

SFT阶段的数据集会开源吗? #251

Closed murray-z closed 2 months ago

myownskyW7 commented 2 months ago

Please check the report of InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD for dataset details https://arxiv.org/pdf/2404.06512.pdf