llama3.2 vision models - Githubissues

Is your feature request related to a problem? Please describe. Llama3.2 was released, and as it has multimodal support would be great to have it in LocalAI

Describe the solution you'd like

Describe alternatives you've considered

Additional context llama.cpp have several issues wrt multimodal capabilities:

https://github.com/ggerganov/llama.cpp/issues/9643
https://github.com/ggerganov/llama.cpp/issues/8010

vLLM has already added support for it in https://github.com/vllm-project/vllm/pull/8811

mudler / LocalAI

llama3.2 vision models #3669