AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
11
stars
7
forks
source link
https://github.com/aska-0096/navi3x_ck.git is not open source #70
https://github.com/aska-0096/navi3x_ck.git is not open source