2U1 / Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.
Apache License 2.0
84 stars 6 forks source link

Dataset Format Support #9

Open yfvl opened 1 week ago

yfvl commented 1 week ago

Great work, thanks! I see you convert LLAVA format, that you require, back to the original format that is required by Qwen. Can you add support for datasets that already come in the correct format?

"messages": [ { "role": "system", "content": [{"type": "text", "text": system_message}], }, { "role": "user", "content": [ {"type": "text", "text": question}, {"type": "image", "image": image_path} ] }, { "role": "assistant", "content": [ {"type": "text", "text": answer} ] } ] }

2U1 commented 1 week ago

I could make it but, lot of multi-modal dataset uses the format from llava so I'm using the same format. It's better for ask the gpt for the conversion code.