Continuous Training Performance Drop

Hi all, Firstly, I would like to thank the authors for giving such excellent works. I have some questions about the training that starts from checkpoint. Below is my validation xent curve, and different color indicates the boundary for continuous training.

截圖 2020-04-15 下午9 28 33

The setting is identical in the training process, but the performance drops suddenly at the beginning of continuous training. I also check the training xent curve and the learning rate curve to make sure the optimizer is also loaded correctly:

截圖 2020-04-15 下午9 28 24 截圖 2020-04-15 下午9 28 40

As you may see, the training seems to be normal. Thus, I would like to search for the help. Does anyone suffer this issue before? Any comment would be very appreciated.

nlpyang / PreSumm

Continuous Training Performance Drop #154