Summary:
Add asymmetric shapes to get_input_iter() in order to test accuracy and performance of sum Triton kernel implementations against PyTorch.
This diff generates tensors with dimensions of different sizes. For example, a 2D asymmetric tensor would have shape (n, n + 3); a 3D asymmetric tensor would have shape (n, n + 3, n + 6).
Summary: Add asymmetric shapes to
get_input_iter()
in order to test accuracy and performance ofsum
Triton kernel implementations against PyTorch.This diff generates tensors with dimensions of different sizes. For example, a 2D asymmetric tensor would have shape
(n, n + 3)
; a 3D asymmetric tensor would have shape(n, n + 3, n + 6)
.Reviewed By: jbschlosser
Differential Revision: D58509022