-
### Your current environment
```text
Collecting environment information...
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
…
-
Open issue to openly discuss potential ideas or improvements, whether on documentation, interfaces, examples, bug fixes, etc.
-
Hi all,
I am working on a kernel which hits an assertion in `RemoveLayoutConversions` pass during the IR rewrite (the latest `main` branch). The bug is common for both `cuda` and `hip` backends.
…
-
**_This is an issue currently facing by many users of Rodeo, so please dont close this_**
I have been a user of rodeo for an year or more. Eventhough rodeo has many bugs and problems I stayed with…
ghost updated
3 years ago
-
i start the job the i met this error:
cuda: 12.0
torch: 1.14
deepspeed --num_gpus 2 pretrain_gpt_v2.py --tensor-model-parallel-size 1 --pipeline-model-parallel-size 1 --distributed-backend nccl -…
-
forge freeze on "AttributeError: 'NoneType' object has no attribute '_id'" in any version after 10b5ca25 commit, i was have rocm 5.7 and i got this problem then updated to rocm 6.1 and still the same …
-
Hi all,
I have a fulll AMD machine with an AMD RX 6700 XT gpu.
I tried to install in Debian Testing these packages from the Debian repositories:
```
mirto@mostro:~$ sudo nala show hipcc |grep V…
-
```
===> Testing for spheral-2023.03.0
===> spheral-2023.03.0 depends on file: /usr/local/bin/python3.9 - found
cd /usr/ports/science/spheral/work/.build && /usr/bin/env F77="gfortran12" F90="g…
-
Hi, I'm new to Kokkos and playing with Kokkos with CUDA backend.
I encountered this memory access error when running one of the tutorials "exercise_01_solution". Please let me know if I'm doing anyt…
-
### 🐛 Describe the bug
Hi,
I'm trying to make my [MoE Triton kernel](https://github.com/RobertCsordas/moe_layer/blob/master/triton_src/moe_layer/cvmm.py) work with torch.compile(). I know that thi…