Closed xiaohuguo2023 closed 1 month ago
streamk v0.2:
new streamk tuning script to reduce compiling and profiling time
use load/store cache modifier to reimplement spinning lock
add CI test for streamk-kernel
able to use streampipelineV2
let's close this PR as there are too many difference with new main_perf, it's not safe to merge anymore. I have created a new ]PR]( https://github.com/ROCm/triton/pull/652) for v0.2
streamk v0.2:
new streamk tuning script to reduce compiling and profiling time
use load/store cache modifier to reimplement spinning lock
add CI test for streamk-kernel
able to use streampipelineV2