TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models
Apache License 2.0
85 stars 5 forks source link

[BUG] Value Error noqa:E501 #13

Open hxhcreate opened 1 month ago

hxhcreate commented 1 month ago

When I using dpo_llava finetune on custom dpo dataset, after several steps, I run into the following error message

ValueError: The input provided to the model are wrong. The number of image tokens is 3 while the number of image given to the model is 4. This prevents correct indexing and breaks batch generation.

How does that happen since I have checked my dataset in strictly following format