Open oulgen opened 11 months ago
cc: @jansel @Chillee since we chatted about this
Hi @davidberard98 is there any updates on this issue? Has this been resolved or do we want to take action on it? Thanks.
cc @oulgen what's the priority on this? I don't have concrete plans to work on this in the short term, but now with caching being enabled is it something that is higher priority now?
@davidberard98 @zhxchen17 This is still good to fix but I dont think it is actively blocking anything
🐛 Describe the bug
When inductor generates a kernel, it emits inside the async_compile.triton(...). The code inside this block is cached across different graphs. However, a recent change introduced
'kernel_name'
field inside the meta in order to identify kernels. This field makes it so that the caching can no longer work as each blob is now unique. Example:This does not affect caching for kernels in same graph as inductor has another layer of caching.
Two possible solutions @davidberard98 and I discussed are 1) emitting two wrappers in codegen one for profiling and one for regular execution where the profiling one records the name and 2) emitting a wrapper around cached async compile.
Versions
origin/main
cc @ezyang @anijain2305 @chauhang @penguinwu @voznesenskym @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @msaroufim @wconstab @bdhirsh @zou3519 @yf225 @aakhundov