When we write SDDMM as S .* (A * B), we don't currently get the benefits of fusion because the pointwise mul happens after a reduction. We need to add a heuristic to do backwards fusion at least in the cases that there is only one consumer of a reduction.
When we write SDDMM as
S .* (A * B)
, we don't currently get the benefits of fusion because the pointwise mul happens after a reduction. We need to add a heuristic to do backwards fusion at least in the cases that there is only one consumer of a reduction.