sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.
https://sglang.readthedocs.io/en/latest/
Apache License 2.0
5.21k stars 369 forks source link

Development Roadmap (2024 Q4) #1487

Open Ying1123 opened 11 hours ago

Ying1123 commented 11 hours ago

Here is the development roadmap for 2024 Q4. Contributions and feedback are welcome (Join Weekly Development Meeting). Previous 2024 Q3 roadmap can be found in #634.

Performance

Parallelism

Hardware Coverage

Model Coverage

LoRA support

Quantization

@zhyncs @ispobock

Server API

Observability

Others

fengyang95 commented 8 hours ago

Are there any plans to optimize long context latency?