NVIDIA / Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
Other
271 stars 53 forks source link

Fix the setting of `gdimx` in 2d and 3d inner reduction heuristics #3432

Closed rdspring1 closed 2 days ago

rdspring1 commented 3 days ago

This PR fixes the setting of gdimx in 2d and 3d inner reduction heuristics.

Previously, gdimx == LaunchParams::UNINITIALIZED_VAL in the else block, so the gdimx > scheduler_utils::x_grid_limit if statement was always false and gdimx was not set.

rdspring1 commented 3 days ago

!test