cuda-graph Search Results

NVIDIA/Megatron-LM #1279

[BUG] Encountering NaN gradients when using CUDA Graph

**Describe the bug** When using Megatron-Core v0.9.0 with CUDA Graphs enabled, NaN gradients are encountered during the backward computation. This issue does not occur when CUDA Graphs are disabled. …

DXZDXZ updated 2 weeks ago

huggingface/parler-tts #154

CUDA Graph issues when using "reduce-overhead"

It seems I am experiencing CUDA-Graph related issues when using "reduce-overhead" method for compilation. Am I right to assume that the reason for this is because CUDA graphs very much expect the same…

medemi68 updated 1 month ago

lpiccinelli-eth/UniDepth #84

TypeError: autotune() got an unexpected keyword argument 'us…

/home/shahao/anaconda3/envs/Unidepth/lib/python3.11/site-packages/timm/models/layers/__init__.py:48: FutureWarning: Importing from timm.models.layers is deprecated, please import via timm.layers wa…

CharlotteHao updated 5 days ago

Oneflow-Inc/oneflow #10567

source install error

### Description when i use `cmake .. -C ../cmake/caches/cn/cuda.cmake && make -j$(nproc)` install ，it show error： ``` In file included from /tmp/oneflow/oneflow/core/ep/cuda/cuda_stream.h:31, …

TZHelloWorld updated 4 days ago

NVIDIA/TensorRT #4254

User allocator error allocating 86114304000-byte buffer fail…

## Description I tried to convert onnx to an engine file using demo_img2vid. py on rtx4090，but received the following error message： [E] [defaultAllocator.cpp::allocate::31] Error Code 1: Cuda Runti…

kolyh updated 1 hour ago

vllm-project/vllm #9546

[Usage]: Qwen2VL model mrope implemenation in cuda graph

### Anything you want to discuss about vllm. in qwen2vl's mrope imple, vllm decide whether input positions is for multimodal with ![image](https://github.com/user-attachments/assets/6dfc96d9-5162-…

gujiewen updated 1 month ago

XiaoxinHe/G-Retriever #30

CUDA error: invalid device ordinal (Finetuning with only on…

Hi Xiaoxin, I want to express my appreciation for the incredible work you and your team are doing. But I encounted a problem: When I attempt to run the process on a single GPU, I get the follow…

quqxui updated 4 days ago

triton-inference-server/server #7150

CUDA Graph not work

**Description** CUDA Graph not work in tensorrt backend. The model config as below: ``` platform: "tensorrt_plan" version_policy: { latest: { num_versions: 2}} parameters { key: "execution_mode"…

SunnyGhj updated 1 month ago

vllm-project/vllm #2143

Use LRU cache for CUDA Graphs

another way to save memory is to use LRUcache for this map, and capture it on demand. _Originally posted by @scv119 in https://github.com/vllm-project/vllm/pull/1926#discussion_r1427594126_ …

WoosukKwon updated 1 week ago

sgl-project/sglang #1558

[Bug] Exception: Capture cuda graph failed: Triton Error [CU…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version. - [ ] 3. Please note that if the bug-related issue y…

a136214808 updated 2 weeks ago

1000+ results for cuda-graph

1000+ results
for cuda-graph