vllm Search Results - Githubissues

1000+ results
for vllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #6465

[Bug]: failed when run Qwen2-54B-A14B-GPTQ-Int4(MOE)

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.1+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

weiminw updated 2 days ago
10
PygmalionAI/aphrodite-engine #497

[Bug]: Segmentation fault (core dumped)

### Your current environment ``` (vllm-gptq) root@k8s-master01:/workspace/home/lich/QuIP-for-all# pip3 list | grep aphrodite aphrodite-engine 0.5.3 /workspace/home/lich/aphrodite-eng…

ChuanhongLi updated 4 days ago
1
vllm-project/vllm #7469

[Feature]: ROCm 6.2 support & FP8 Support

### 🚀 The feature, motivation and pitch Last week AMD announced rocm 6.2 (https://rocm.docs.amd.com/en/latest/about/release-notes.html) also announcing expanded support for VLLM & FP8. Actuall…

ferrybaltimore updated 2 days ago
2
GAIR-NLP/auto-j #13

vllm 推理速度很慢

为啥推理一条需要接近1分钟，8*80G A100 ``` import os from vllm import LLM, SamplingParams import torch from constants_prompt import build_autoj_input from zh_constants_prompt import zh_build_autoj_input impo…

txy6666yr updated 7 months ago
1
ModelsLab/diffusers_plus_plus #9

Add Lumina - Transforming Text into Any Modality

a series of text-conditioned Diffusion Transformers (DiT) capable of transforming textual descriptions into vivid images, dynamic videos, detailed multi-view 3D images, and synthesized speech. Code…

shauray8 updated 3 months ago
1
skypilot-org/skypilot #3873

LLama 3.1 example not working with runpod

When I run llama 3.1 example with runpd I'm getting this error: h/sky-key' root@69.30.85.136 -p 22035 -o StrictHostKeyChecking=no -o PasswordAuthentication =no -o ConnectTimeout=10s -o UserKn…

Stealthwriter updated 1 week ago
3
vllm-project/vllm #6368

[Feature]: Request for Ascend NPU support

### 🚀 The feature, motivation and pitch # Background Currently, the project supports various hardware accelerators such as GPUs, but there is no support for NPUs. Adding NPU support could signific…

xuedinge233 updated 1 week ago
13
vllm-project/vllm #5847

[Bug]: OutOfMemoryError when loading a small model with a hu…

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

alugowski updated 2 months ago
3
stanfordnlp/dspy #1041

Failed to parse JSON response: {"detail":"Not Found"}

The code im running: ``` lm = dspy.HFClientVLLM(model="NurtureAI/Meta-Llama-3-8B-Instruct-32k", port=38242, url="http://localhost", max_tokens=4) test_text = "This is a test article. abc" output_n…

tom-doerr updated 4 weeks ago
6
vllm-project/vllm #4553

[Bug]: AssertionError in neuron_model_runner.py assert len(b…

### Your current environment PyTorch version: 2.1.2+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.4 LTS (x86_64) GCC version: (U…

calvintwr updated 2 weeks ago
2

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for vllm

1000+ results
for vllm