ROCm / triton

Development repository for the Triton language and compiler
MIT License
83 stars 27 forks source link

Optimized stream-k kernel for AMD GPUs #415

Closed zhanglx13 closed 5 months ago

zhanglx13 commented 8 months ago

Close for now to save some CIs. May reopen in the future.

zhanglx13 commented 5 months ago

@xiaohuguo2023 Can we wrap up and merge this PR? We can call this one version 0. If you do need to keep all the kernels, maybe you can create a subdir under perf-kernels, like perf-kernels/streamk/.

zhanglx13 commented 5 months ago

Sorry I didn't mean to close it.

xiaohuguo2023 commented 5 months ago

@zhanglx13, I think it's ready for merge. Thanks !