Open Luodian opened 2 weeks ago
Cross pin for explaination on video data.
Q: About video data? A: It's to be released in @ZhangYuanhan-AI next version of a more powerful video model. Currently we released the data yaml used in onevision stage at onevision.yaml.
You can checkout the three subsets video data, (1) sharegpt4video_255000.json (checkout sharegpt4video) (2) 0718_0_30_s_academic_mc_v0_1_all.json (to be released) (3) academic_source_30s_v1_all.json (to be released).
Checkout here to see the three yamls.
https://github.com/LLaVA-VL/LLaVA-NeXT/tree/main/scripts/train