Great work, thanks! I see you convert LLAVA format, that you require, back to the original format that is required by Qwen. Can you add support for datasets that already come in the correct format?
I could make it but, lot of multi-modal dataset uses the format from llava so I'm using the same format.
It's better for ask the gpt for the conversion code.
Great work, thanks! I see you convert LLAVA format, that you require, back to the original format that is required by Qwen. Can you add support for datasets that already come in the correct format?
"messages": [ { "role": "system", "content": [{"type": "text", "text": system_message}], }, { "role": "user", "content": [ {"type": "text", "text": question}, {"type": "image", "image": image_path} ] }, { "role": "assistant", "content": [ {"type": "text", "text": answer} ] } ] }