Open mmatena opened 3 years ago
It looks like I was just fine-tuning the head weights and not the body for the first pass. I think it's because I was setting trainable=False
because I was otherwise getting gradient not found errors.
I might hold off a bit until I get a working SimCLR implementation.
Also look at the learning rate and regularization strengths that I chose.
I'm calling the second pass the one after I shifted it from frozen-body fine-tuning.
Note: I should also try some experiments with supervised ImageNet pretrained models. A lot of the stuff here will be the same as for them, but I'll just focus on SimCLR stuff here.