Closed DacDinh147 closed 3 years ago
I think it due to cosine annealing learning rate schedule has greatly change at the period.
in our yolor-csp experiments, decoupled head +0.5% AP, simOTA +1.0% AP, free anchor almost same AP (+ - 0.2%), multi positive works well for fine-tuning model but usually not work when training from scratch.
@WongKinYiu Thank you a lot for your quick and detail response. After reading the article, I understand the situation that I was in. I close the question. Have a nice day!
@WongKinYiu @DacDinh147 Where are these improvements
Hi author. Thank for your great work. I learn from you a lot. I read your log in YOLOV4 development and I would like to learn from your experiment in training with Implicit + Decouple Head + SimOTA + Free Anchor setting.
I reimplement your work in a new framework and adding Decouple Head + SimOTA + Free anchor on top of Implicit A + M
Currently, I am training on COCO128 toy dataset from scratch, and the training is very unstable around 4k and 6k iteration, MAPs suddenly drop as show in the above figure. Did you experience this issue in your own training and would you please share some trick or insight to avoid this issue? I appreciate it a lot.