Closed Anndrey24 closed 1 month ago
This commit extends the SME conv2d NHWC schedule to support convolutions with float16 inputs (data and kernel) and a float32 output using the tensor intrinsics added in #16981.
cc @ekalda @lhutton1
Thanks @Anndrey24 and @lhutton1!
This commit extends the SME conv2d NHWC schedule to support convolutions with float16 inputs (data and kernel) and a float32 output using the tensor intrinsics added in #16981.
cc @ekalda @lhutton1