ROCm / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
11 stars 7 forks source link

fix fx2ait bug #39

Closed Yanxing-Shi closed 1 year ago

Yanxing-Shi commented 1 year ago

https://github.com/ROCmSoftwarePlatform/AITemplate/pull/37 previously affected fx2ait, now fix.