ROCm / composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
https://rocm.docs.amd.com/projects/composable_kernel/en/latest/
Other
251 stars 102 forks source link

Remove gfx900 and gfx906 from default target device to reduce package size #1351

Closed zjing14 closed 1 week ago