Closed viv-eth closed 8 months ago
This PR adds the FP32 LayerNorm kernel utilizing SSRs and FREP to improve performance.
This PR adds the FP32 LayerNorm kernel utilizing SSRs and FREP to improve performance.