issues
search
ROCm
/
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
11
stars
7
forks
source link
[WIP] Gemm API for navi3
#38
Closed
Yanxing-Shi
closed
1 year ago