Open cm2435 opened 11 months ago
For relevancy here are my system settings.
OS- ubuntu 22.04 NVCC- 12.3 | NVIDIA-SMI 525.85.05 Driver Version: 525.85.05
the underlying card is a rtx 3090.
Tutorials on this would be great if anyone has one. I tried to follow along with the implementation of @pommedeterresautee in the kernl package and wrap the pytorch file in a pytorch.autograd.Function class and then wrap the .apply of that in a function, but still the compiler complains about any kernel launch not being symbolically tracable.
GitHub Issue
Hello! Sorry for the slightly crossposty issue but this seems like the most likely place I could post this and get some help.
I'm trying a basic 'hello world' In subbing out operator patterns from a symbolic trace of a PyTorch module to try to get to grips with really integrating the triton kernels I write with PyTorch native code.
To replicate this, I took the tutorial-1 code on vector addition:
And wrote the most basic PyTorch module I could come up with:
Following the PyTorch documentation on operator pattern substitution, it seems to trace fine (as it should):
But the code
fails with the error
I understand this has to do with the PyTorch compiler being unable to symbolically trace code with dynamic control flow. The specific error point for this is the launch kernel for 'add_kernel'; I just wanted to see if anyone had any advice on how to properly do this and what I'm missing.
Much thanks.