LLaVA-VL / LLaVA-NeXT

Apache License 2.0
2.4k stars 167 forks source link

[Common Issues] Releasing LLaVA-OneVision data yaml files in three stages (mid/single-image/onevision) #199

Open Luodian opened 2 weeks ago

Luodian commented 2 weeks ago

Checkout here to see the three yamls.

https://github.com/LLaVA-VL/LLaVA-NeXT/tree/main/scripts/train

Luodian commented 2 weeks ago

Cross pin for explaination on video data.

https://github.com/LLaVA-VL/LLaVA-NeXT/issues/130

Luodian commented 2 weeks ago

Q: About video data? A: It's to be released in @ZhangYuanhan-AI next version of a more powerful video model. Currently we released the data yaml used in onevision stage at onevision.yaml.

You can checkout the three subsets video data, (1) sharegpt4video_255000.json (checkout sharegpt4video) (2) 0718_0_30_s_academic_mc_v0_1_all.json (to be released) (3) academic_source_30s_v1_all.json (to be released).