google-research / big_transfer

Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.
https://arxiv.org/abs/1912.11370
Apache License 2.0
1.5k stars 175 forks source link

PyTorch BiT-M-ResNet152x4 model bug #34

Open junsukchoe opened 4 years ago

junsukchoe commented 4 years ago

Hello,

I got unexpected behaviors of BiT-M-ResNet152x4 on PyTorch. When I use BiT-M-R152x4-ILSVRC2012.npz on ImageNet-1k dataset, most of model outputs are nan. Hence, the resultant performance is very low. Given that other models work well, I think that there might be some bugs on the pretrained weights of BiT-M-R152x4.