xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
https://inference.readthedocs.io
Apache License 2.0
4.94k stars 391 forks source link

feat: Add support for Llama 3.2-Vision models #2376

Open vikrantrathore opened 1 day ago

vikrantrathore commented 1 day ago

This pull request introduces support for the Llama 3.2-Vision collection of multimodal large language models (LLMs) within Xinference. These models bring the capability to process both text and image inputs, expanding the potential for diverse applications.

Key Changes:

This pull request adds support for the Llama 3.2-Vision collection of multimodal LLMs for both the transformers and vllm engines.