Closed dinhanhx closed 2 years ago
Hi! Can you fix the link for the AssertionError
? It does not point to an assertion
@carmocca Sorry I didn't make myself clear at that point. When the stack trace is printed, I see AssertionsError below the line I linked. I will send you the complete stack trace when I get back to my machine.
@carmocca I have updated the issue with traces.
Thanks! I guess this is caused by one of the recent mypy PRs we've merged. But the real assertion error does not get surfaced.
cc @kaushikb11 @awaelchli
@carmocca I downgraded lightning from 1.7.1 to 1.6.5. Then I rerun the boring models. It yields no errors.
We have a test for manual optimization silently failing with the same error here: #14034 I have not yet found a way to make the test failures surface in the CI.
Correction: I no longer see these failures from a few weeks ago. All tests are passing. @dinhanhx maybe it is worth trying master?
@awaelchli I just tried master 2022.08.22. Yes it works.
@kaushikb11 or @dinhanhx It would be great if you can git-bisect the commits between 1.7.0 and current master to find which commit fixed it so we include it in a bug-fix release.
@kaushikb11 @carmocca sorry I don't know how to use git bisect :(
Closing as there will be no more bug-fix releases before 1.8
🐛 Bug
As title, it only works with core. When multiple cores, it yields Assertion Errors under this line https://github.com/Lightning-AI/lightning/blob/acd4805f1a284e513272d150de6f98f27a0489b3/src/pytorch_lightning/loops/optimization/manual_loop.py#L110
To Reproduce
Expected behavior
It runs.
Environment
Additional context
I runs on Google TPU VM 3.8
cc @kaushikb11 @rohitgr7