Closed taddyb closed 1 year ago
The code runs, but doesn't have any runoff. I'll have to dive into time periods that we can actually train on
The in-place operation was caused by the error_check()
function. Torch believed that checking if result was NaN modified the tensor in place, so it threw an error.
deleted the function, and now DDP is working
Single process is working too. Merging this code to main
What was done:
Tagged issues:
Steps to run the code:
Terminal:
Pycharm: