ROCm / triton

Development repository for the Triton language and compiler
MIT License
92 stars 29 forks source link

Apply kernarg optimization to Triton #381

Closed wangye805 closed 7 months ago

wangye805 commented 1 year ago

Apply kernarg optimization to Triton

zhanglx13 commented 1 year ago

How does the performance (FA and GEMM) look like before and after this PR?