LLMServe / DistServe

Disaggregated serving system for Large Language Models (LLMs).
Apache License 2.0
114 stars 9 forks source link