Closed LeeDoYup closed 4 years ago
Because the running variables in the momentum encoder are never used in training or fine-tuning in this codebase.
If you need to use them for your project, you can include them in the update.
Thanks for the quick reply ! I want to check the below thinkgs.
ResNet-50 uses batch normalization and the default setting uses the running varibles. So the momentum encoder uses them. However, i think it also calculate when the momentum encoder uses forward(im_k).
Hello. Thanks for the awesome project ! I have a question.
I wonder why the EMA update doesn't track the running mean and variance of BN.
I think below codes are right, because the ema model has poor performance when the running variables in BN are not tracked.