-
>https://nvidia.github.io/cuda-python/cuda-bindings/latest/api.html
We have documentation for the Driver, Runtime and NVRTC APIs, but none of nvJitLink. It should be added.
-
**Describe the bug**
Any use of `shfl_sync` throws an error saying `shfl_recurse` is a dynamic function.
**To reproduce**
The Minimal Working Example (MWE) for this bug:
Attempting to do a stream…
-
Initializing a TMA descriptor through the driver APIs
https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__TENSOR__MEMORY.html
is really tedious and error prone. We need a way to abstract it out,…
-
**Describe the bug**
When running using CUDA after launching Julia with Nsight Systems, the program quits, but a profiling report is still generated.
**To reproduce**
The Minimal Working Example (…
-
Without the compiler option: https://github.com/mrakgr/Spiral-s-ML-Library/blob/c5d8a529b210f84dc955a017aeff455c2d27affd/game/leduc/fast_compile.py
With --Ofast-compile=max: https://github.com/mrakgr…
-
### Your current environment
```text
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
OS: Ubuntu 22.04.4 LTS (x86_64)
GCC ve…
-
### 🐛 Describe the bug
The test case [`test_multi_device_cpu`](https://github.com/pytorch/pytorch/blob/1b95ca904f5020ad8649677cbef683fac9d8e768/test/inductor/test_aot_inductor.py#L304C1-L314C50) actu…
-
### Description
I used
```
python3 -m pip install --upgrade "jax[cuda12]"
```
to install JAX on a GPU node, but am getting a `CUDA_ERROR_SYSTEM_NOT_READY` error:
```
(base) $ python3 -c "import…
-
I want to install Apollo9.0 in a server, and the GPU is RTX4090, the version of cuda driver is 535.161.07, the version of cuda is 12.2.
When compiling the source code following the [tutorial](https…
-
I have built PhoenixOS in the container provided in the document, and then I want to test C/R of ResNet program in the same container, but there seems to be a problem with the API forwarding.
I fir…