sglang Search Results - Githubissues

776 results
for sglang

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

varunshenoy/super-json-mode #8

SGLang Integration

Nice project! I believe this project can greatly benefit from https://github.com/sgl-project/sglang. You can try to use SGLang as a backend for local models. - The fast JSON decoding [feature](h…

merrymercy updated 2 months ago
2
vllm-project/vllm #7108

[Performance]: under performing in comparision of sglang

### Proposal to improve performance vLLm is under performing in comparison with sglang. There is something which need optimization for better performance. ### Report of performance regression https…

meetzuber updated 1 day ago
2
sgl-project/sglang #1900

Expose max_total_num_tokens for Token Limit Calculation in R…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version. - [ ] 3. Please note that if the bug-related issue y…

hahmad2008 updated 8 hours ago
1
sgl-project/sglang #1857

TP8 scheduling overhead is very high for small model, Llama …

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related issue y…

hliuca updated 36 minutes ago
9
sgl-project/sglang #1828

Questions Regarding sglang vs vllm and Memory Management

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version. - [ ] 3. Please note that if the bug-related iss…

hahmad2008 updated 5 days ago
3
runpod-workers/worker-sglang #9

performance issue investigation

I've been investigating a performance issue with SGLang on RunPod's serverless platform. Here are my key findings: I identified that SGLang performs significantly worse on the serverless setup comp…

supa-thibaud updated 1 month ago
6
stanfordnlp/dspy #1719

How do I load huggingface models?

Hi, There seem to be some big changes and I cannot find a single example that tells me how to load huggingface models that I was using with `HF.model` before. Also the dspy AI tool is broken and no…

Zoher15 updated 5 hours ago
14
modelscope/evalscope #128

evalscope perf 测试sglang 部署的openai api server 无法输出结果

版本：evalscope 0.5.3 sglang 0.3.0 在本地起了一个sglang的openai api server，命令如下： CUDA_VISIBLE_DEVICES=4,5,6,7 python -m sglang.launch_server --model-path /local/models/Qwen2-72B-Instruct --tp 4 …

hetian127 updated 1 month ago
1
sgl-project/sglang #1715

[Feature] Cascade attention kernels

We would like to integrate the [cascade attention kernel](https://flashinfer.ai/2024/02/02/cascade-inference.html) from flashinfer. Code pointers: - Attention backend in sglang: https://github.com…

merrymercy updated 1 week ago
2
LLaVA-VL/LLaVA-NeXT #5

sglang support?

The [announcement blog post](https://llava-vl.github.io/blog/2024-04-30-llava-next-video/) indicates inference can be done with sglang, but attempting to load the 7b model with the sglang backend: …

figuernd updated 5 months ago
3

上一页 1...1 2 3 4 5 6 7...78 下一页

776 results for sglang

776 results
for sglang