Closed vinx13 closed 2 months ago
This is a great mechanism to integrate kernel generators. Some notes on design that might help generalize it abit
Would be great to change the intrinsic to T.call_kernel, which checks the first parameter for kernel types and dispatch accordingly.
call_kernel
leverages the registered mapping to do automatic conversion.
Added a macro
T.call_triton
in TIR script parser, which expands to AOT compilation of the kernel and the host TIR code to launch the kernel.cc @tqchen @cyx-6