HabanaAI / vllm-fork

A high-throughput and memory-efficient inference and serving engine for LLMs
https://docs.vllm.ai
Apache License 2.0
39 stars 48 forks source link

Update SynapseAI version in README & Dockerfile #390

Closed kzawora-intel closed 1 week ago