Closed diggerdu closed 2 years ago
Hi, thanks for this great bug report. I've not tested this code in evaluation mode.
This bug happens because torch.*
functions will call __torch_function__
when evaluating LazyTensor
, and because LazyTensor
is designed to evaluate eagerly in evaluation mode, it causes an recursion error.
I don't think the fix in #4 is ideal, however, because the code should evaluate eagerly in evaluation mode (where OOM almost never happens because the graph does not need to be saved). So the 'correct' fix would be to run
every arg
and kwarg
before passing them to torch.*
functions.
Will fix it when I can, but as of right now, I'm going to close #4 .