LLaVA-VL / LLaVA-NeXT

1.01k stars 55 forks source link

Question about M4-Instruct datasets #89

Open syspider opened 2 days ago

syspider commented 2 days ago

Thank your for your kindly release!

But when i looking at the annotations of M4-Instruct, the FIRST sample just quite confused me. Here is the snapshot: image

The human and GPT value seem to be wrong. Obviously it should be "human value" first and giving an instruction with multiple images. But in this sample, instruction is given by GPT, and answer is given by human with images.

Looking forward to your reply.

syspider commented 2 days ago

All the samples seem to have the same problem when the data source is "twitter_post"