咨询一些teacher update的具体细节？

Translated above version for future reference : "Hello, Thanks for the excellent method, I have a few questions: The experimental details in the paper say that the learning rate in the semi-supervised phase is constant 0.001, and the teacher model is updated every 25 epochs: Then the student is trained for 25 to 75 epochs depending on the amount of unlabeled data with learning rate 0.001, and the teacher is updated every 25 epochs. May I ask why you don't use a onecycle learning rate strategy, and the teacher model for small data sets is no longer updated during the training process (because small data sets only need to be trained for 25 epochs)? Looking forward to your answer, thank you"

yinjunbo / ProficientTeachers

咨询一些teacher update的具体细节？ #9