issues
search
ROCm
/
rocBLAS
Next generation BLAS implementation for ROCm platform
https://rocm.docs.amd.com/projects/rocBLAS/en/latest/
Other
340
stars
157
forks
source link
re-optimize kernels when not large index
#1327
Closed
TorreZuk
closed
1 year ago
TorreZuk
commented
1 year ago
reduce computation when indices don't required int64