vllm Search Results - Githubissues

1000+ results
for vllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/MInference #43

[Question]: Why is running MInference/examples/run_vllm.py n…

### Describe the issue ```python from vllm import LLM, SamplingParams from minference import MInference prompts = [ "Hello, my name is", "The president of the United States is", …

zjjznw123 updated 1 month ago
1
lm-sys/FastChat #3473

不更新了吗？

fastchat不更新对新模型的支持吗？

a1015498799 updated 7 hours ago
8
HandH1998/QQQ #12

Does QQQ linear support H100?

I tried to run it on H100, but it seems there is an illegal memory access inside the kernel. ``` RuntimeError: CUDA error: an illegal memory access was encountered ```

donglinz updated 2 weeks ago
1
brynzai/aish #11

VLLM

Hi Do you support models throw vllm inference?

myrulezzz updated 7 months ago
7
Alpha-VLLM/Lumina-mGPT #19

can lumina-mgpt support inference speed up project such as v…

thank you for you great work. i met a question that the generation of a image is too slow, generate a 512x512 image cost almost one minute. so i want to know that can lumina-mgpt support inference spe…

yyyouy updated 1 week ago
1
intel-analytics/ipex-llm #11985

On A770，vllm and llama.cpp which brings better performance f…

Any data table for benchmark?

yangqing-yq updated 6 days ago
3
LLMServe/DistServe #19

codellama34b ttft延迟问题

你好，在最近的测试中，我在A100上测试Llama-13b、7b等模型，对比vllm和distserve, 在满足slo的情况下， distserve性能要优于vllm，但是在测试codellama-34b过程中，当我的输入长度为8192，发现TTFT要高出vllm约3倍左右，请问这个情况是正常的吗？vllm使用tp2, distserve使用prefill tp2, decode tp2。

sitabulaixizawaluduo updated 1 month ago
7
vllm-project/vllm #7727

[New Model]: MiniCPM-V-2_6-int4

### The model to consider. https://huggingface.co/openbmb/MiniCPM-V-2_6-int4 ### The closest model vllm already supports. _No response_ ### What's your difficulty of supporting the model you want?…

tangent2018 updated 2 days ago
5
vllm-project/vllm #7576

[Bug]: vllm运行卡住

### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### 🐛 Describe the bug (base) bob@test-ESC8000A-E11:~$ python…

backtime1 updated 2 weeks ago
3
vllm-project/vllm #8074

[Feature]: Support multi-node serving on Kubernetes

### 🚀 The feature, motivation and pitch Hi, I'm currently working on **deploying vLLM distributed on multi-node in k8s cluster**. I saw that the official documentation provided a link by using [LWS…

flliny updated 4 days ago
4

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for vllm

1000+ results
for vllm