Open mbateman opened 1 year ago
Yesterday I encountered an error like that, a temporary solution can delete the model file that was created, and run again, the error will be gone
I also had this, but I actually would like to be able to load the last checkpoint so deleting the model is not a solution for me. Is there any way to resume training with a saved checkpoint?
Description
...
Environment information
For bugs: reproduction and error logs