-
### 🐛 Describe the bug
https://github.com/microsoft/DeepSpeed/issues/6673
try install deepspeed . on torch 2.5.0-cuda
then
running build_ext
```error
D:\my\env\python3.10.10\lib\site-packages…
-
### 🐛 Describe the bug
We encountered an illegal memory access issue with `torch.compile` and customized torch library operator.
Here's one minimal example to reproduce:
```python
import torch…
-
This is the error log
```
0%| …
-
When I run the train.py file to train the Davis dataset, I set input_dim_drug in the config file to 212 as prompted by the author. But then a runtime error occurs:
RuntimeError: CUDA error: device-si…
-
### Your current environment
/usr/lib/python3.10/inspect.py:288: FutureWarning: `torch.distributed.reduce_op` is deprecated, please use `torch.distributed.ReduceOp` instead
return isinstance(objec…
-
I was looking for methods like `cudaGetDeviceProperties` from `include/cuda_runtime_api.h` but they don't seem to be present. Is there a reason for that, or is it just not done yet? I can look at addi…
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTor…
-
### Describe the bug
When both CUDA and HIP devices are present in the system, switching between them causes a crash. Specifically, after a CUDA device is used, submitting operations to a HIP devic…
-
`linalg::Svd()` outputs NaN when the input tensor contains only zeros. This issue only happens on GPU and doesn't happen when the data type is float. This bug is the culprit of the broken `Svd.gpu_U1_…
-
### Describe the issue
we currently use large max_length in beam search, but we got max_length