mudler / LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
https://localai.io
MIT License
26.42k stars 1.98k forks source link

llama3.2 vision models #3669

Open mudler opened 2 months ago

mudler commented 2 months ago

Is your feature request related to a problem? Please describe. Llama3.2 was released, and as it has multimodal support would be great to have it in LocalAI

Describe the solution you'd like

Describe alternatives you've considered

Additional context llama.cpp have several issues wrt multimodal capabilities:

vLLM has already added support for it in https://github.com/vllm-project/vllm/pull/8811

See also:

briansan commented 2 months ago

+1