InternLM / InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
1.91k stars 120 forks source link

Caption data of ShareGPT4Video dataset #330

Closed Marlod390 closed 2 weeks ago

Marlod390 commented 3 weeks ago

Dear authors,

thank your for your great work. I want to check the text content of your ShareGPT4Video dataset, but I only found videos on huggingface. Where can I find the corresponding captions?