Closed embg closed 3 days ago
Do you ever run those tests while override ptxas?
Do you ever run those tests while override ptxas?
@ThomasRaoux we run these tests on our internal pin using an older CUDA toolchain. The issue was discovered during internal pin update to 3.2.x.
@peterbell10 Fixed all nits!
Do you ever run those tests while override ptxas?
@ThomasRaoux we run these tests on our internal pin using an older CUDA toolchain. The issue was discovered during internal pin update to 3.2.x.
I know this is very minor changes but overall I'm not a big fan of this direction as supporting all the ptxas versions is obviously impossible. Are you testing on older ptxas to catch potential problems related to ptxas or do you have patches downstream that you want to test. If you have a way I would disable those tests downstream instead.
@ThomasRaoux Spinning up a thread on Slack to discuss.
TMA fences require CUDA toolchain 12.3 or greater, but current gating does not check the CUDA toolchain version. This causes
test_experimental_tma.py
to fail when run with older CUDA toolchains.Before
With cuda-12.0:
With cuda-12.4:
After
With cuda-12.0:
With cuda-12.4: