NVIDIA / Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
Other
271 stars 53 forks source link

split inner reduction heuristics into 2d and 3d heuristics #3330

Closed liqiangxl closed 1 week ago

liqiangxl commented 2 weeks ago

What's in this PR? This PR separates inner reduction heuristics into distinct 2D and 3D heuristic functions.

Why? The 2D and 3D reductions represent different domain structures within the reduction tensor view:

These two configurations require different parallelization strategies, so keeping them in separate functions enhances maintainability and allows for individual optimization of each heuristic.

code changes The existing innerReductionHeuristic() is duplicated as inner2dReductionHeuristic and inner3dReductionHeuristic, will clean inner2dReductionHeuristic in a separate PR.

liqiangxl commented 2 weeks ago

!build

liqiangxl commented 2 weeks ago

!build

liqiangxl commented 1 week ago

!build