Closed LukasNel closed 1 year ago
Ah yes this looks like my notebook from a while back-- I think we want to train one at a time now, see here. I use this script now, which trains them separately (see the last few lines of the shell script)
duude, that's awesome thanks so much. You should definitely link that repo to this one.
haha, it's a bit rough around the edges because I keep changing it to try to make my training runs work properly. maybe if I get something working stably then we can talk to lucidrains :P
cool, it worked 😄
yea, like it said, you need to train each network one at a time (separate training script), then you can chain them altogether
Hey @lucidrains @LWprogramming, I am not able to wrap my head around the issue. I am trying to run on a single T4 gpu each notebook cell sequentially. I am still getting the Assertion Error.
Error
The code to check that seems to be fairly newly added.