vectorch-ai / ScaleLLM

A high-performance inference system for large language models, designed for production environments.
https://docs.vectorch.com/
Apache License 2.0
317 stars 24 forks source link

LoRA: QLoRA/S-LoRA: Serving thousands of LoRA adapters #166

Open guocuimi opened 2 months ago