vllm Search Results - Githubissues

1000+ results
for vllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #9180

[Installation]: Pytorch nightly version 2.6 meets error: err…

### Your current environment click here to view the env ``` Collecting environment information... PyTorch version: 2.6.0.dev20241008+cu124 Is debug build: False CUDA used to build PyTorch:…

shaoyuyoung updated 4 days ago
10
liunian-Jay/MU-GOT #10

不支持tensor_parallel_size=2？

model = LLM(model=model_name, max_model_len=4096, trust_remote_code=True,gpu_memory_utilization=0.6,tensor_parallel_size=2) File "/lib/python3.10/site-packages/vllm/executor/multiproc…

cqray1990 updated 3 hours ago
3
vllm-project/vllm #9854

[help wanted]: add sliding window support for flashinfer

### Anything you want to discuss about vllm. flashinfer already supports sliding window in https://github.com/flashinfer-ai/flashinfer/issues/159 , and we should update our code and pass sliding wind…

youkaichao updated 3 days ago
1
QwenLM/Qwen2-VL #492

OSError: Error no file named model.safetensors found in dire…

Hi, I have finetuned Qwen2-VL using Llama-Factory. I successfully quantized the fine-tuned model as given ``` from transformers import Qwen2VLProcessor from auto_gptq import BaseQuantizeC…

bhavyajoshi-mahindra updated 5 days ago
2
dottxt-ai/outlines #1130

Outlines's cache not reusable across vllm startup

### Describe the issue as clearly as possible: When using vllm and outlines, when running it from a VM, it seems that the diskcache functionality is not working correctly. Every time the server is …

Lap1n updated 1 month ago
1
vllm-project/vllm #9831

[Feature]: host wheel via pypi index?

### 🚀 The feature, motivation and pitch Currently we host vllm wheels in aws, and ask users to install wheels via a long link: `pip install https://vllm-wheels.s3.us-west-2.amazonaws.com/nightly/v…

youkaichao updated 4 days ago
1
vllm-project/vllm #9875

[Bug]: Running on a single machine with multiple GPUs error

### Your current environment Name: vllm Version: 0.6.3.post2.dev171+g890ca360 ### Model Input Dumps _No response_ ### 🐛 Describe the bug I used the interface from this vllm repository …

Wiselnn570 updated 4 hours ago
6
EleutherAI/lm-evaluation-harness #2431

vllm with tensor_parallel_mode is not working at all because…

`CUDA_VISIBLE_DEVICES=0,1 lm_eval --model vllm \ --model_args pretrained=/home/jovyan/data-vol-1/models/meta-llama__Llama3.1-70B-Instruct,tensor_parallel_size=2,dtype=auto,gpu_memory_utilization=…

95jinchul updated 2 days ago
1
vllm-project/vllm #9874

[Bug]: Function calling with Qwen & Streaming ('NoneType' ob…

### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### Model Input Dumps _No response_ ### 🐛 Describe the bug …

githebs updated 2 days ago
4
vllm-project/vllm #8779

vLLM's V2 Engine Architecture

This issues describes the high level directions that "create LLM Engine V2". We want the design to be as transparent as possible and created this issue to track progress and solicit feedback. Goal…

simon-mo updated 3 weeks ago
7

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for vllm

1000+ results
for vllm