vllm Search Results - Githubissues

1000+ results
for vllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lm-sys/FastChat #2833

使用vllm_worker进行模型加载，卡着不动

## 问题描述使用第2第3块gpu启动时，卡着不动（而使用1 2、1 3的两两组合则没有问题） cuda版本：12.1.0 Driver版本: 535.54.03 torch: 2.1.2 fschat: 0.2.34 vllm: 0.2.6 ray: 2.8.1 ## 启动命令 ```shell CUDA_VISIBLE_DEVICES="2,3" python -…

wfs420100 updated 2 weeks ago
2
vllm-project/vllm #3002

some error happend when installing vllm

Nvidia jetson is aarch64 , in ubuntu20.04 server(cuda 12.2), when run "pip install vllm " , some error happened : × Getting requirements to build wheel did not run successfully. │ exit code: …

finylink updated 4 months ago
5
vllm-project/vllm #5825

[RFC]: Classifier-Free Guidance

### Motivation. I am one of the authors of the paper Stay On Topic with Classifier-Free Guidance ( https://openreview.net/forum?id=RiM3cl9MdK&noteId=s1BXLL1YZD ) who has been nominated as ICML'24 Spo…

Vermeille updated 2 weeks ago
2
vllm-project/vllm #5827

[Bug]: Internal Server Error when hosting Alibaba-NLP/gte-Qw…

### Your current environment Using latest available docker image: vllm/vllm-openai:v0.5.0.post1 ### 🐛 Describe the bug I am getting as response "Internal Server Error" when calling the /v1/embedd…

markkofler updated 1 month ago
1
vllm-project/vllm #3012

Unable to specify GPU usage in VLLM code

I am facing difficulties in specifying GPU usage for different models for LLM inference pipeline using vLLM. Specifically, I have 4 RTX 4090 GPUs available, and I aim to run a LLM with a size of 42GB …

humza-sami updated 2 months ago
15
vllm-project/vllm #3878

[Bug]: vllm 0.4.0.post1 crashed when loading dbrx-instruct o…

### Your current environment * vllm (commit `db2a6a41e206abecf4128aba25117fcaf7bebe12`) + ROCm 6.0 Docker image built with the [fix of Dockerfile.rocm](https://github.com/vllm-project/vllm/issues/386…

vgod-dbx updated 2 months ago
4
Maximilian-Winter/llama-cpp-agent #68

Request for image input support

I plan to implement the function calling with vision models such as LLaVA and Nous-Hermes-2-Vision-Alpha based on the image, but it seems that the current implementation in the example folder only sup…

reachsak updated 1 month ago
1
vllm-project/vllm #4675

[Feature]: vAttention

### 🚀 The feature, motivation and pitch Claim major improvements over vllm. Unfortunately no code only the paper. arxiv.org/abs/2405.04437 ### Alternatives _No response_ ### Additional context …

nivibilla updated 3 days ago
4
vllm-project/vllm #2933

Runtime exception [step must be nonzero]

Somehow `max_prompt_len` may be 0 in this code: https://github.com/vllm-project/vllm/blob/264017a2bf030f060ebad91eb9be9b4e0033edb9/vllm/worker/model_runner.py#L232 ``` | File "/usr/local/lib…

DreamGenX updated 2 months ago
4
vllm-project/vllm #3998

[Bug]: Qwen1.5-14B-Chat使用vllm==0.3.3版本在Tesla V100-PCIE-32GB显…

### Your current environment Qwen1.5-14B-Chat使用vllm==0.3.3版本在Tesla V100-PCIE-32GB显卡上部署结果全部是感叹号，无结果 ### 🐛 Describe the bug Qwen1.5-14B-Chat使用vllm==0.3.3版本在Tesla V100-PCIE-32GB显卡上部署结果全部是感叹号，无结果…

li995495592 updated 2 weeks ago
18

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for vllm

1000+ results
for vllm