Closed jagrit06 closed 2 weeks ago
Fuses gemm, addmm, and block_sparse_mm into one Uber-shader that uses metal function constants for specialization No performance regression has been seen on a M2 Ultra
gemm
addmm
block_sparse_mm
Put an x in the boxes that apply.
x
pre-commit run --all-files
Proposed changes
Fuses
gemm
,addmm
, andblock_sparse_mm
into one Uber-shader that uses metal function constants for specialization No performance regression has been seen on a M2 UltraChecklist
Put an
x
in the boxes that apply.pre-commit run --all-files
to format my code / installed pre-commit prior to committing changes