-
Hi, Thanks for the package! I came across the source code in all branches, but I couldn't find any CUDA-related kernels, device/host functions, or CUDA includes/macros. From what I observed, the compu…
-
### Is there an existing issue?
- [X] I have searched the existing issues
### Experiencing problems? Have you tried our Discord first?
- [X] This is not a support question.
### Description of bug
…
-
---
## Feature request
I would like a nopython `@cfunc` to be able to launch a `@cuda.jit` kernel.
```python3
import numba.types as nt
@numba.cuda.jit(nt.void(nt.CPointer(nt.float32), nt…
-
### Problem Description
Hi, I am getting an error when running the `tune_gemm.py` script.
I am inside a docker container with access to 8 AMD MI300X gpus, displayed when calling `rocm-smi` and I ha…
-
### Required prerequisites
- [x] Consult the [security policy](https://github.com/NVIDIA/cuda-quantum/security/policy). If reporting a security vulnerability, do not report the bug using this form. U…
-
## Minimum reproducible example
cargo build --release --features cuda
## Error
error: failed to run custom build command for `mistralrs-quant v0.3.1 (C:\Users\misur\Desktop\rustsrc\mistral.rs.0.3…
-
CUDA programming , which is essential for ML/AI optimization, is incredibly sought in the ML industry especially as we entered the LLM era. In order to make the neural network training faster and more…
-
### 软件环境
```Markdown
paddle2onnx 1.2.3
paddlefsl 1.1.0
paddlenlp 3.0.0b1
paddleocr 2.8.1
paddlepaddle 2.6.2
paddlepaddl…
-
Hi, I'm trying to build Darknet with CUDA support on Windows but I'm getting an error. I have a clean windows VM, have followed the steps in the readme including installing CUDA after Visual Studio. B…
-
# Christian Mills - CUDA MODE Lecture 1: How to profile CUDA kernels in PyTorch
Lecture #1 provides a practical introduction to integrating and profiling custom CUDA kernels within PyTorch programs, …