ShareGPT4Omni / ShareGPT4Video

[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
https://sharegpt4video.github.io/
1.26k stars 44 forks source link

Great work! What is the difference between these two models? #18

Closed WilTay1 closed 4 months ago

WilTay1 commented 4 months ago

image

xiaoachen98 commented 4 months ago

image

We use LLaVA-Next-8B for our ShareGPT4Video-8B for easy reproduction. We choose InternLM-XComposer2-4KHD which can handle a wide range of resolutions and aspect ratios of images to perform a general captioner for various videos.