Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Apache License 2.0
1.15k
stars
77
forks
source link
Tensor.item should be disallowed in the CUDA Graphs region #727
File thunder.CUDAGraph0_fn_3:10, in CUDAGraph0_fn(***failed resolving arguments***)
9 del x
---> 10 f1 = Tensor.item(t0) # f1: "float ?"
11 del t0
RuntimeError: CUDA error: operation not permitted when stream is capturing
Note: If you have a model or program that is not supported yet but should be, please use the program coverage template.
🐛 Bug