-
**Describe the bug**
When using Megatron-Core v0.9.0 with CUDA Graphs enabled, NaN gradients are encountered during the backward computation. This issue does not occur when CUDA Graphs are disabled.
…
-
It seems I am experiencing CUDA-Graph related issues when using "reduce-overhead" method for compilation. Am I right to assume that the reason for this is because CUDA graphs very much expect the same…
-
/home/shahao/anaconda3/envs/Unidepth/lib/python3.11/site-packages/timm/models/layers/__init__.py:48: FutureWarning: Importing from timm.models.layers is deprecated, please import via timm.layers
wa…
-
### Description
when i use `cmake .. -C ../cmake/caches/cn/cuda.cmake && make -j$(nproc)` install ,it show error:
```
In file included from /tmp/oneflow/oneflow/core/ep/cuda/cuda_stream.h:31,
…
-
## Description
I tried to convert onnx to an engine file using demo_img2vid. py on rtx4090,but received the following error message:
[E] [defaultAllocator.cpp::allocate::31] Error Code 1: Cuda Runti…
kolyh updated
3 hours ago
-
### Anything you want to discuss about vllm.
in qwen2vl's mrope imple, vllm decide whether input positions is for multimodal with
![image](https://github.com/user-attachments/assets/6dfc96d9-5162-…
-
Hi Xiaoxin,
I want to express my appreciation for the incredible work you and your team are doing.
But I encounted a problem: When I attempt to run the process on a single GPU, I get the follow…
-
**Description**
CUDA Graph not work in tensorrt backend. The model config as below:
```
platform: "tensorrt_plan"
version_policy: { latest: { num_versions: 2}}
parameters { key: "execution_mode"…
-
another way to save memory is to use LRUcache for this map, and capture it on demand.
_Originally posted by @scv119 in https://github.com/vllm-project/vllm/pull/1926#discussion_r1427594126_
…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [ ] 2. The bug has not been fixed in the latest version.
- [ ] 3. Please note that if the bug-related issue y…