Does anybody know why the score jump a lot when dropping the learning rate last few epoches during the training?

IDEA-Research / DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Apache License 2.0

2.24k stars 250 forks source link

Does anybody know why the score jump a lot when dropping the learning rate last few epoches during the training? #92

Open AI-Passionner opened 2 years ago

AI-Passionner commented 2 years ago

@SuperHenry2333 I only see this behavior in YOLOX when turning off the augmentation in the last few epochs. Most of the other object detections didn't show this behavior. In other words, why didn't you try the cosine learning rate scheduler?

In my case, when the learning rate dropped at the last epoch, the score dramatically gained 3%, which is very solid.

HaoZhang534 commented 2 years ago

We adopted the same learning rate schedule as previous DETR-like works to suddenly drop to 1/10 of the original learning rate. We did not compare sudden drop with cosine drop. Methods adopting sudden drop usually have a large performance gain when dropping the learning rate.