issues
search
sgl-project
/
sglang
SGLang is a fast serving framework for large language models and vision language models.
https://sgl-project.github.io/
Apache License 2.0
6.22k
stars
531
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
rust e2e test
#2184
ByronHsu
opened
7 minutes ago
0
[router] Replace print with logger
#2183
ByronHsu
closed
32 minutes ago
0
Bump rustls from 0.23.16 to 0.23.18 in /rust
#2182
dependabot[bot]
closed
2 hours ago
0
[Bug] Qwen2-VL-7B IndexError
#2181
jakep-allenai
opened
5 hours ago
0
[CI] Split test cases in CI for better load balancing
#2180
merrymercy
closed
9 hours ago
0
feat: add should_use_tensor_cores
#2179
zhyncs
opened
10 hours ago
1
[Feature] Get the real logprobs to analysis decoding
#2178
Snowdar
opened
12 hours ago
1
[Bug] frequency penalty
#2177
vivian0429
opened
12 hours ago
1
Update XGrammar to the latest API
#2176
Ubospica
opened
12 hours ago
5
[Fix] Avoid calling fill_vocab_mask for terminated requests
#2175
Ubospica
closed
12 hours ago
0
feat: fused_moe fp8 monkey patch
#2174
zhyncs
closed
13 hours ago
2
[feat] Refactor session control interface and add CI
#2173
Ying1123
closed
1 hour ago
0
Question about ragged wrapper
#2172
ZhongYingMatrix
closed
13 hours ago
2
[Performance]: Process affinity to CPU cores with multiple sockets support
#2171
HaiShaw
opened
15 hours ago
1
Replace prob based with threshold based load balancing
#2170
ByronHsu
closed
14 hours ago
2
Allow overwrite flashinfer use_tensorcore
#2169
merrymercy
closed
17 hours ago
0
[Feature] How to accelerate constrained decoding when regex needs to change with input?
#2168
GrittyChen
opened
18 hours ago
0
[Fused moe] add tuning fused configs for qwen2 57b and mixtral 8x7b
#2167
BBuf
closed
19 hours ago
3
[Bug] cannot import name 'CachedGrammarCompiler' from 'xgrammar' (version 0.3.6)
#2166
Quang-elec44
opened
21 hours ago
0
test select concurrency
#2165
qeternity
opened
1 day ago
5
Fix docs
#2164
merrymercy
closed
1 day ago
0
Rename triton_fused_moe -> fused_moe_triton
#2163
merrymercy
closed
1 day ago
1
Balance CI tests
#2162
merrymercy
closed
1 day ago
0
fix: use torch.sum for compatible
#2161
zhyncs
closed
1 day ago
0
[Bug] FusedMoE compatible with vllm 0.6.3.post1
#2160
zhyncs
closed
1 day ago
0
Update CI threshold & Improve code style
#2159
merrymercy
closed
1 day ago
0
Fix mixed chunked prefill in overlap mode
#2158
merrymercy
closed
1 day ago
0
fix: resolve end-of-file-fixer
#2157
zhyncs
closed
1 day ago
0
feat: update other MoE models deps
#2156
zhyncs
closed
1 day ago
5
feat: update gitignore and add tuning config for FusedMoE
#2155
zhyncs
closed
1 day ago
0
Simplify `Scheduler.update_running_batch`
#2154
merrymercy
closed
1 day ago
0
feat: remove the dependency on FusedMoE
#2153
zhyncs
closed
1 day ago
2
Merged three native APIs into one: get_server_info
#2152
henryhmko
closed
1 day ago
2
[Bug] llava use image hash as token,leading to cache bug
#2151
zwc163
opened
1 day ago
1
Speculative EAGLE2. New PR
#2150
yukavio
opened
1 day ago
0
Byhsu/fairness router
#2149
ByronHsu
opened
1 day ago
0
Improve sglang router
#2148
ByronHsu
closed
1 day ago
0
add prefix match for certain tenant
#2147
ByronHsu
closed
1 day ago
0
Add more api routes (completion, health, etc) to the router
#2146
ByronHsu
closed
1 day ago
0
[Draft] Resolving integration differences after XGrammar lauch refactoring
#2145
gittb
closed
6 hours ago
5
fix dp_rank env
#2144
ByronHsu
closed
2 days ago
0
update router doc
#2143
ByronHsu
closed
2 days ago
0
Bump sglang-router to 0.0.5
#2142
ByronHsu
closed
2 days ago
2
[Bug] Error when using LLAVA 1.5 for llava bench
#2140
pspdada
closed
1 day ago
1
fix: resolve bench_serving args
#2139
zhyncs
closed
2 days ago
1
Fix dp print message
#2138
merrymercy
closed
2 days ago
0
[CI] Fix test cases
#2137
merrymercy
closed
2 days ago
0
Add concurrency option for benchmark
#2136
cermeng
closed
2 days ago
1
Add concurrency option in benchmark
#2135
cermeng
closed
2 days ago
0
Fix grid size in Triton decoding kernel
#2134
ispobock
closed
2 days ago
1
Next