llava_v1_5_mix665k dataset

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

https://llava.hliu.cc

Apache License 2.0

19.41k stars 2.14k forks source link

llava_v1_5_mix665k dataset #835

Open taltlusty opened 10 months ago

taltlusty commented 10 months ago

Describe the issue

Hello Looking at the dataset list, which dataset does the prompts with an empty model belong to? For example:

"id": "wgByO4Y_0", "model": "",

Thanks

aneet-javis commented 10 months ago

@taltlusty Where did you get this dataset from? Didn't find in playground/data.

taltlusty commented 10 months ago

Thanks @aneet-javis This is the published dataset for finetuning: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K/blob/main/llava_v1_5_mix665k.json

CrazyBrick commented 9 months ago

Thanks @aneet-javis This is the published dataset for finetuning: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K/blob/main/llava_v1_5_mix665k.json

It seems that I have browsed it under other issues before: Adding some plain text Q&A from llava_v1_5_mix665k to his custom dataset (image based Q&A) can improve his fine-tuning effect.

zengxingchen commented 6 months ago

the same. I also found that some ocr_vqa data do not exist in the downloaded data..........

421zuoduan commented 5 months ago

the same. some ocr_vqa data do not exits