any plans to support vision models?

unslothai / unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

https://unsloth.ai

Apache License 2.0

18.58k stars 1.3k forks source link

any plans to support vision models? #1149

Closed reza8iucs closed 4 days ago

reza8iucs commented 1 month ago

I am looking for libraries that support fine tuning vision models like Llama 2.3 Vision or Phi-3-vision. Do you have any plans to support these multimodal models in future? if not , any recommendations?

danielhanchen commented 1 month ago

Working on it!!

aeltorio commented 1 month ago

@danielhanchen thank you

shimmyshimmer commented 4 days ago

@reza8iucs @aeltorio @Nazzaroth2 @Any-Winter-4079

Hey guys apologies for the delays. Vision models are now supported in Unsloth! Please update Unsloth :)

Read our blogpost: https://unsloth.ai/blog/vision Tweet: https://x.com/UnslothAI/status/1859667930075758793 GitHub post: https://github.com/unslothai/unsloth/releases/tag/November-2024 Model uploads: https://huggingface.co/collections/unsloth/vision-multimodal-models-673eb9908fc2cb3deebd2fa3