I am using Colab to train ArcFace-Torch on Asian-Celeb dataset. After Colab disconects, I want to continue training from where it left by setting config.resume = True and successfully loading the latest backbone.pth file. However, the training process starts all over from epoch 0, acc and loss seems to be same as acc and loss of the first time I train from scratch, and even worse. Did I successfully resume training? What should I do to fix it? Thank in advance.
Hi, Thanks for your nice work!
I am using Colab to train ArcFace-Torch on Asian-Celeb dataset. After Colab disconects, I want to continue training from where it left by setting config.resume = True and successfully loading the latest backbone.pth file. However, the training process starts all over from epoch 0, acc and loss seems to be same as acc and loss of the first time I train from scratch, and even worse. Did I successfully resume training? What should I do to fix it? Thank in advance.