Closed wangye805 closed 7 months ago
Apply kernarg optimization to Triton
How does the performance (FA and GEMM) look like before and after this PR?
Apply kernarg optimization to Triton