sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.
https://sgl-project.github.io/
Apache License 2.0
6.27k stars 545 forks source link

Fix mixed chunked prefill in overlap mode #2158

Closed merrymercy closed 4 days ago

merrymercy commented 4 days ago