ROCm / composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
https://rocm.docs.amd.com/projects/composable_kernel/en/latest/
Other
297 stars 113 forks source link

Merging the gfx12 code into public repo. #1362

Closed illsilin closed 3 months ago

illsilin commented 3 months ago

Enabling the gfx12 support.