-
### Description
Using cupyx.scipy.signal.upfirdn is failing CUDADriverError for some combinations of parameters when the input signal is 2D. It seems to depend on both the types and the up/down facto…
-
### 🐛 Describe the bug
The process is working correctly with DDP world size 1 but then with world size > 1 is going to hang with GPU 0 at 0% and GPU 1 fixed to max occupancy. I've replicated this bot…
-
### Your current environment
The output of `python collect_env.py`
```text
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N…
-
Hi, I have been trying to use this project for the last few days, unsuccessfully.
Installed all dependencies on Ubuntu 22.04, as instructed, and the build was successful.
Using NVCC 11.5 . Older ver…
-
this is what I get
meriem@Home:~/llama-gpt$ ./run.sh --model 7b --with-cuda
[+] Building 4.2s (30/30) FINISHED …
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch…
-
Hi
I want to use nvrtc to compile a sm90 kernel in runtime. The problem is that I don't have the kernel instance on host thus can't run to_underlying_arguments to get kernel param to launch the kerne…
-
Hello, IREE developers, I've seen a similar question asked before: https://github.com/iree-org/iree/issues/11573#issuecomment-1371128261, but it seems lack of further discussion.
Suppose we have a c…
LWenH updated
2 weeks ago
-
### Your current environment
click here to view the env
```
Collecting environment information...
PyTorch version: 2.6.0.dev20241008+cu124
Is debug build: False
CUDA used to build PyTorch:…
-
### Your current environment
Collecting environment information...
PyTorch version: 2.3.0
Is debug build: False
CUDA used to build PyTorch: 12.2
ROCM used to build PyTorch: N/A
OS: Ubuntu 22.0…