sgl-project sglang issues

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

https://sgl-project.github.io/

Apache License 2.0

6.22k stars 531 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

rust e2e test

#2184 ByronHsu opened 7 minutes ago
0
[router] Replace print with logger

#2183 ByronHsu closed 32 minutes ago
0
Bump rustls from 0.23.16 to 0.23.18 in /rust

#2182 dependabot[bot] closed 2 hours ago
0
[Bug] Qwen2-VL-7B IndexError

#2181 jakep-allenai opened 5 hours ago
0
[CI] Split test cases in CI for better load balancing

#2180 merrymercy closed 9 hours ago
0
feat: add should_use_tensor_cores

#2179 zhyncs opened 10 hours ago
1
[Feature] Get the real logprobs to analysis decoding

#2178 Snowdar opened 12 hours ago
1
[Bug] frequency penalty

#2177 vivian0429 opened 12 hours ago
1
Update XGrammar to the latest API

#2176 Ubospica opened 12 hours ago
5
[Fix] Avoid calling fill_vocab_mask for terminated requests

#2175 Ubospica closed 12 hours ago
0
feat: fused_moe fp8 monkey patch

#2174 zhyncs closed 13 hours ago
2
[feat] Refactor session control interface and add CI

#2173 Ying1123 closed 1 hour ago
0
Question about ragged wrapper

#2172 ZhongYingMatrix closed 13 hours ago
2
[Performance]: Process affinity to CPU cores with multiple sockets support

#2171 HaiShaw opened 15 hours ago
1
Replace prob based with threshold based load balancing

#2170 ByronHsu closed 14 hours ago
2
Allow overwrite flashinfer use_tensorcore

#2169 merrymercy closed 17 hours ago
0
[Feature] How to accelerate constrained decoding when regex needs to change with input?

#2168 GrittyChen opened 18 hours ago
0
[Fused moe] add tuning fused configs for qwen2 57b and mixtral 8x7b

#2167 BBuf closed 19 hours ago
3
[Bug] cannot import name 'CachedGrammarCompiler' from 'xgrammar' (version 0.3.6)

#2166 Quang-elec44 opened 21 hours ago
0
test select concurrency

#2165 qeternity opened 1 day ago
5
Fix docs

#2164 merrymercy closed 1 day ago
0
Rename triton_fused_moe -> fused_moe_triton

#2163 merrymercy closed 1 day ago
1
Balance CI tests

#2162 merrymercy closed 1 day ago
0
fix: use torch.sum for compatible

#2161 zhyncs closed 1 day ago
0
[Bug] FusedMoE compatible with vllm 0.6.3.post1

#2160 zhyncs closed 1 day ago
0
Update CI threshold & Improve code style

#2159 merrymercy closed 1 day ago
0
Fix mixed chunked prefill in overlap mode

#2158 merrymercy closed 1 day ago
0
fix: resolve end-of-file-fixer

#2157 zhyncs closed 1 day ago
0
feat: update other MoE models deps

#2156 zhyncs closed 1 day ago
5
feat: update gitignore and add tuning config for FusedMoE

#2155 zhyncs closed 1 day ago
0
Simplify `Scheduler.update_running_batch`

#2154 merrymercy closed 1 day ago
0
feat: remove the dependency on FusedMoE

#2153 zhyncs closed 1 day ago
2
Merged three native APIs into one: get_server_info

#2152 henryhmko closed 1 day ago
2
[Bug] llava use image hash as token，leading to cache bug

#2151 zwc163 opened 1 day ago
1
Speculative EAGLE2. New PR

#2150 yukavio opened 1 day ago
0
Byhsu/fairness router

#2149 ByronHsu opened 1 day ago
0
Improve sglang router

#2148 ByronHsu closed 1 day ago
0
add prefix match for certain tenant

#2147 ByronHsu closed 1 day ago
0
Add more api routes (completion, health, etc) to the router

#2146 ByronHsu closed 1 day ago
0
[Draft] Resolving integration differences after XGrammar lauch refactoring

#2145 gittb closed 6 hours ago
5
fix dp_rank env

#2144 ByronHsu closed 2 days ago
0
update router doc

#2143 ByronHsu closed 2 days ago
0
Bump sglang-router to 0.0.5

#2142 ByronHsu closed 2 days ago
2
[Bug] Error when using LLAVA 1.5 for llava bench

#2140 pspdada closed 1 day ago
1
fix: resolve bench_serving args

#2139 zhyncs closed 2 days ago
1
Fix dp print message

#2138 merrymercy closed 2 days ago
0
[CI] Fix test cases

#2137 merrymercy closed 2 days ago
0
Add concurrency option for benchmark

#2136 cermeng closed 2 days ago
1
Add concurrency option in benchmark

#2135 cermeng closed 2 days ago
0
Fix grid size in Triton decoding kernel

#2134 ispobock closed 2 days ago
1