issues
search
sgl-project
/
sglang
SGLang is a fast serving framework for large language models and vision language models.
https://sgl-project.github.io/
Apache License 2.0
6.27k
stars
545
forks
source link
Fix mixed chunked prefill in overlap mode
#2158
Closed
merrymercy
closed
4 days ago
merrymercy
commented
4 days ago
Fix the one token delay in mixed chunked prefill
Fix the logprob return value in mixed chunked prefill by temporarily disable it