zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, qwen-vl, qwen2-vl, phi3-v etc.
Apache License 2.0
170 stars 22 forks source link

LlaVa interleave for AutoCompletion #44

Closed sm745052 closed 2 weeks ago

sm745052 commented 3 weeks ago

Hi !! We were trying to LORA finetune LLaVa interleave for a autocompletion task on a dataset (DialogCC) that might contain many images (>10) per conversation.

zjysteven commented 3 weeks ago
zjysteven commented 2 weeks ago

Closing for now. Feel free to reopen if there are more questions.