A retargetable MLIR-based machine learning compiler and runtime toolkit.
2.85k
stars
622
forks
source link
[GPU]: stack frame size (294916) exceeds limit (131056) in function 'torch_jit$async_dispatch_1_softmax_64x4x144x144xf32_dispatch_tensor_store #19180
Open
pdhirajkumarprasad opened 1 week ago
What happened?
For the given IR
getting error as
while it's working fine in CPU.
Steps to reproduce your issue
command:
version: IREE compiler version 3.0.0rc20241117 @ 29c451b00ecc9f9e5466e9d1079e0d69147da700
detail log:
dump.log
What component(s) does this issue relate to?
Compiler
Version information
No response
Additional context
No response