Hi all,
Firstly, I would like to thank the authors for giving such excellent works. I have some questions about the training that starts from checkpoint. Below is my validation xent curve, and different color indicates the boundary for continuous training.
The setting is identical in the training process, but the performance drops suddenly at the beginning of continuous training. I also check the training xent curve and the learning rate curve to make sure the optimizer is also loaded correctly:
As you may see, the training seems to be normal. Thus, I would like to search for the help. Does anyone suffer this issue before? Any comment would be very appreciated.
Hi all, Firstly, I would like to thank the authors for giving such excellent works. I have some questions about the training that starts from checkpoint. Below is my validation xent curve, and different color indicates the boundary for continuous training.
The setting is identical in the training process, but the performance drops suddenly at the beginning of continuous training. I also check the training xent curve and the learning rate curve to make sure the optimizer is also loaded correctly:
As you may see, the training seems to be normal. Thus, I would like to search for the help. Does anyone suffer this issue before? Any comment would be very appreciated.