MCG-NJU / TDN

[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
https://arxiv.org/abs/2012.10071
Apache License 2.0
368 stars 55 forks source link

train new dataset with pretrained model #69

Open sanwei111 opened 1 year ago

sanwei111 commented 1 year ago

hello doctor.I TRAIN tdn with new data with pretrain model,but it easily go converge,in the first epoch,does something wrong?besides,when i train,it notice below,it is ok? [12/08 15:20:34] TDN INFO: #### Notice: keys that failed to load: {'module.base_model.layer3_bak.3.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer3_bak.3.mse.bn3_smallscale2.running_var', 'module.base_model.layer2_bak.1.mse.conv3.weight', 'base_model.layer2_bak.2.shift.conv.weight', 'base_model.layer3_bak.1.mse.bn3_smallscale4.running_var', 'module.base_model.layer3_bak.0.bn3.num_batches_tracked', 'module.base_model.layer3_bak.1.mse.bn3.running_var', 'base_model.layer3_bak.5.mse.bn1.weight', 'base_model.resnext_layer1.0.bn1.running_var', 'base_model.layer3_bak.0.bn2.num_batches_tracked', 'module.base_model.layer2_bak.2.conv1.bias', 'module.base_model.resnext_layer1.2.conv2.weight', 'module.base_model.layer3_bak.0.mse.bn3_smallscale2.running_mean', 'base_model.layer2_bak.2.mse.bn3_smallscale2.bias', 'base_model.layer2_bak.3.mse.bn3_smallscale4.bias', 'base_model.layer2_bak.2.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.resnext_layer1.0.bn3.weight', 'module.base_model.layer2_bak.0.bn1.num_batches_tracked', 'module.base_model.layer1_bak.2.bn1.bias', 'base_model.layer3_bak.2.bn2.num_batches_tracked', 'base_model.layer3_bak.3.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer3_bak.5.mse.bn3_smallscale4.running_mean', 'base_model.layer2_bak.2.conv1.bias', 'module.base_model.layer3_bak.0.mse.bn1.bias', 'base_model.layer4_bak.0.mse.bn1.running_mean', 'base_model.layer1_bak.0.downsample.1.bias', 'base_model.layer2_bak.0.bn1.weight', 'base_model.layer2_bak.1.bn2.running_var', 'base_model.layer3_bak.3.bn1.weight', 'base_model.resnext_layer1.2.conv1.weight', 'base_model.resnext_layer1.0.downsample.1.num_batches_tracked', 'base_model.layer3_bak.5.conv1.weight', 'base_model.layer4_bak.0.downsample.1.running_mean', 'module.base_model.layer3_bak.4.bn3.running_mean', 'module.base_model.layer2_bak.1.mse.bn3_smallscale4.running_mean', 'module.base_model.layer3_bak.5.mse.bn3.num_batches_tracked', 'module.base_model.layer3_bak.2.mse.bn1.bias', 'base_model.layer2_bak.2.mse.bn3_smallscale2.weight', 'module.base_model.resnext_layer1.2.bn2.running_var', 'base_model.conv1.bias', 'base_model.layer1_bak.0.bn3.weight', 'module.base_model.layer2_bak.1.mse.bn1.bias', 'base_model.layer3_bak.0.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.layer3_bak.0.mse.bn3_smallscale2.bias', 'base_model.layer3_bak.2.mse.bn3.weight', 'base_model.layer2_bak.1.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.layer3_bak.4.bn2.bias', 'base_model.layer4_bak.2.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer4_bak.2.mse.bn3.running_mean', 'module.base_model.layer4_bak.2.mse.bn1.bias', 'module.base_model.layer1_bak.2.bn1.running_var', 'base_model.layer1_bak.2.conv1.bias', 'base_model.resnext_layer1.0.bn3.running_mean', 'base_model.layer3_bak.0.conv3.weight', 'base_model.layer4_bak.1.conv3.weight', 'module.base_model.layer3_bak.5.conv2.weight', 'base_model.layer4_bak.1.bn2.bias', 'base_model.layer1_bak.1.bn3.bias', 'base_model.layer3_bak.0.bn3.num_batches_tracked', 'module.base_model.layer2_bak.3.bn1.running_var', 'module.base_model.layer2_bak.1.conv3.weight', 'module.base_model.layer4_bak.2.mse.bn3_smallscale2.bias', 'base_model.layer4_bak.1.mse.conv3_smallscale4.weight', 'module.base_model.layer2_bak.1.bn3.running_mean', 'base_model.layer2_bak.1.conv1.bias', 'base_model.resnext_layer1.2.conv2.bias', 'base_model.layer4_bak.1.mse.bn3.running_mean', 'base_model.layer4_bak.2.bn3.num_batches_tracked', 'module.base_model.layer2_bak.1.bn1.weight', 'base_model.layer2_bak.0.downsample.1.weight', 'module.base_model.resnext_layer1.2.bn1.running_var', 'module.base_model.layer3_bak.2.bn1.num_batches_tracked', 'module.base_model.layer4_bak.0.bn3.num_batches_tracked', 'base_model.layer3_bak.4.conv2.weight', 'base_model.layer3_bak.0.mse.bn3.weight', 'module.base_model.layer1_bak.0.bn3.running_var', 'base_model.layer4_bak.1.mse.bn3_smallscale4.weight', 'base_model.layer3_bak.1.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer2_bak.2.mse.bn3.running_mean', 'base_model.resnext_layer1.0.conv1.weight', 'base_model.layer3_bak.5.conv1.bias', 'base_model.layer2_bak.1.conv2.weight', 'module.base_model.layer1_bak.0.downsample.1.running_var', 'base_model.layer2_bak.1.bn3.weight', 'base_model.layer2_bak.3.bn1.weight', 'module.base_model.layer2_bak.1.bn1.bias', 'module.base_model.layer3_bak.3.mse.bn1.running_mean', 'base_model.layer1_bak.2.bn3.running_var', 'module.base_model.layer2_bak.2.bn1.running_mean', 'base_model.resnext_layer1.1.conv1.bias', 'base_model.layer3_bak.3.bn1.num_batches_tracked', 'module.base_model.resnext_layer1.1.conv3.bias', 'module.base_model.resnext_layer1.0.downsample.1.running_var', 'base_model.layer3_bak.3.bn2.bias', 'module.base_model.layer2_bak.1.bn2.num_batches_tracked', 'module.base_model.layer1_bak.2.bn3.num_batches_tracked', 'base_model.layer3_bak.0.mse.bn3_smallscale4.bias', 'module.base_model.layer1_bak.0.conv2.bias', 'base_model.layer3_bak.1.bn1.running_var', 'base_model.resnext_layer1.1.bn2.num_batches_tracked', 'base_model.resnext_layer1.1.bn3.weight', 'base_model.layer2_bak.3.bn2.weight', 'module.base_model.layer2_bak.2.mse.bn1.running_var', 'module.base_model.layer4_bak.2.bn3.weight', 'base_model.layer2_bak.2.bn2.weight', 'base_model.layer1_bak.0.conv1.bias', 'module.base_model.layer3_bak.0.bn2.weight', 'base_model.layer3_bak.0.downsample.0.bias', 'base_model.layer2_bak.1.mse.conv1.weight', 'base_model.layer3_bak.3.mse.conv3_smallscale2.weight', 'module.base_model.layer4_bak.0.mse.bn1.num_batches_tracked', 'base_model.layer3_bak.3.bn2.weight', 'base_model.layer2_bak.2.mse.bn1.running_var', 'base_model.layer2_bak.2.bn3.weight', 'module.base_model.layer2_bak.2.bn1.weight', 'module.base_model.layer4_bak.1.mse.bn3.running_mean', 'base_model.resnext_layer1.1.bn3.num_batches_tracked', 'base_model.layer2_bak.2.bn1.bias', 'module.base_model.layer3_bak.3.bn1.weight', 'module.base_model.layer2_bak.3.mse.bn3_smallscale2.weight', 'base_model.resnext_layer1.1.bn3.bias', 'base_model.layer3_bak.2.mse.conv1.weight', 'base_model.resnext_layer1.0.bn3.running_var', 'module.base_model.layer3_bak.1.conv3.bias', 'base_model.layer3_bak.1.mse.bn3.weight', 'module.base_model.layer4_bak.1.conv1.weight', 'module.base_model.layer3_bak.2.conv2.weight', 'base_model.layer4_bak.0.mse.conv1.weight', 'base_model.layer4_bak.1.mse.conv3_smallscale2.weight', 'module.base_model.layer3_bak.1.shift.conv.weight', 'module.base_model.resnext_layer1.0.conv3.bias', 'module.base_model.layer2_bak.0.bn1.running_var', 'base_model.layer3_bak.3.mse.bn3_smallscale4.weight', 'base_model.layer2_bak.1.mse.bn3.num_batches_tracked', 'module.base_model.layer3_bak.1.mse.conv2.weight', 'base_model.resnext_layer1.1.conv3.bias', 'module.base_model.layer4_bak.2.bn1.bias', 'base_model.layer1_bak.1.bn1.weight', 'module.base_model.layer2_bak.1.mse.bn3.num_batches_tracked', 'module.base_model.layer3_bak.5.mse.conv3_smallscale2.weight', 'module.base_model.layer2_bak.2.shift.conv.weight', 'base_model.resnext_layer1.0.downsample.1.running_mean', 'module.base_model.layer1_bak.1.bn1.running_var', 'module.base_model.layer3_bak.4.mse.bn1.running_var', 'base_model.layer4_bak.1.conv1.weight', 'base_model.layer4_bak.0.mse.bn3_smallscale2.bias', 'module.base_model.layer4_bak.0.conv1.weight', 'module.base_model.layer3_bak.3.mse.conv3.weight', 'base_model.layer4_bak.2.mse.bn1.running_var', 'base_model.layer4_bak.0.bn3.weight', 'base_model.layer3_bak.1.mse.bn1.bias', 'base_model.layer2_bak.0.mse.bn3_smallscale2.running_mean', 'module.base_model.layer3_bak.4.mse.conv3_smallscale4.weight', 'base_model.layer3_bak.3.mse.bn1.num_batches_tracked', 'module.base_model.layer4_bak.2.conv1.weight', 'module.base_model.layer3_bak.0.bn1.num_batches_tracked', 'module.base_model.layer3_bak.5.mse.bn3.running_mean', 'module.base_model.layer1_bak.1.bn1.running_mean', 'module.base_model.resnext_layer1.0.bn3.bias', 'base_model.resnext_layer1.0.conv3.bias', 'base_model.layer1_bak.1.conv2.bias', 'base_model.layer1_bak.0.conv1.weight', 'module.base_model.layer3_bak.5.conv3.bias', 'module.base_model.resnext_layer1.0.bn2.bias', 'module.base_model.layer1_bak.2.bn2.bias', 'module.base_model.layer2_bak.2.mse.bn3_smallscale2.running_mean', 'module.base_model.layer3_bak.5.conv2.bias', 'base_model.layer4_bak.0.conv2.weight', 'module.base_model.layer2_bak.2.bn2.running_mean', 'base_model.layer2_bak.0.conv3.weight', 'base_model.layer4_bak.0.bn1.running_var', 'module.base_model.layer3_bak.3.mse.conv1.weight', 'base_model.conv1_5.1.bias', 'base_model.conv1_5.1.running_mean', 'module.base_model.layer1_bak.0.bn2.num_batches_tracked', 'module.base_model.layer2_bak.1.mse.bn3_smallscale2.running_var', 'module.base_model.layer2_bak.2.bn3.num_batches_tracked', 'base_model.conv1_5.1.num_batches_tracked', 'module.base_model.layer3_bak.0.shift.conv.weight', 'base_model.layer1_bak.0.downsample.1.weight', 'base_model.layer3_bak.0.downsample.0.weight', 'base_model.layer3_bak.2.bn2.running_var', 'base_model.layer3_bak.2.bn1.num_batches_tracked', 'base_model.layer3_bak.1.mse.bn1.num_batches_tracked', 'module.base_model.layer3_bak.2.conv3.bias', 'base_model.layer2_bak.0.mse.bn1.num_batches_tracked', 'base_model.layer3_bak.3.conv3.weight', 'module.base_model.layer2_bak.2.bn2.weight', 'module.base_model.layer3_bak.3.bn2.running_var', 'base_model.layer3_bak.0.conv1.weight', 'base_model.layer3_bak.0.mse.bn3_smallscale4.running_var', 'module.base_model.layer4_bak.2.conv2.weight', 'base_model.layer2_bak.1.mse.bn1.running_var', 'base_model.layer3_bak.0.mse.conv3_smallscale4.weight', 'base_model.layer4_bak.2.bn2.running_var', 'module.base_model.layer4_bak.0.bn2.running_mean', 'base_model.layer1_bak.2.bn1.running_mean', 'module.base_model.layer3_bak.2.bn1.bias', 'module.base_model.layer4_bak.1.mse.bn3.running_var', 'module.base_model.layer3_bak.2.mse.bn1.weight', 'module.base_model.layer3_bak.3.conv1.bias', 'module.base_model.resnext_layer1.1.bn2.running_mean', 'module.base_model.layer3_bak.0.downsample.0.weight', 'module.base_model.resnext_layer1.2.bn1.running_mean', 'base_model.resnext_layer1.0.bn3.num_batches_tracked', 'module.base_model.layer4_bak.2.mse.conv3.weight', 'base_model.layer1_bak.2.conv3.bias', 'module.base_model.layer2_bak.3.mse.bn3_smallscale4.running_mean', 'base_model.layer2_bak.2.mse.bn3_smallscale4.running_mean', 'base_model.layer4_bak.1.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer3_bak.5.shift.conv.weight', 'module.base_model.layer4_bak.1.mse.bn1.weight', 'base_model.layer3_bak.4.mse.bn3.weight', 'base_model.layer4_bak.2.conv1.bias', 'base_model.layer2_bak.3.bn3.running_var', 'module.base_model.layer2_bak.0.mse.bn1.running_var', 'module.base_model.layer4_bak.2.bn1.weight', 'base_model.layer3_bak.2.bn2.bias', 'module.base_model.layer2_bak.3.bn1.num_batches_tracked', 'module.base_model.layer2_bak.2.bn1.running_var', 'module.base_model.layer2_bak.1.mse.conv3_smallscale2.weight', 'module.base_model.layer2_bak.3.mse.bn1.num_batches_tracked', 'base_model.layer1_bak.0.conv3.bias', 'module.base_model.layer3_bak.0.conv2.weight', 'base_model.layer1_bak.1.bn3.running_var', 'module.base_model.layer3_bak.3.bn3.weight', 'module.base_model.layer3_bak.0.conv3.bias', 'base_model.layer3_bak.0.mse.bn1.num_batches_tracked', 'module.base_model.layer3_bak.1.mse.bn3.weight', 'module.base_model.layer3_bak.1.bn1.num_batches_tracked', 'module.base_model.layer1_bak.2.conv2.bias', 'base_model.layer3_bak.0.mse.bn3_smallscale4.running_mean', 'module.base_model.resnext_layer1.2.bn3.num_batches_tracked', 'module.base_model.layer4_bak.2.mse.conv2.weight', 'module.base_model.layer1_bak.1.conv2.bias', 'base_model.layer4_bak.1.bn3.weight', 'module.base_model.layer3_bak.0.bn2.bias', 'base_model.layer2_bak.3.mse.bn1.running_mean', 'module.base_model.layer3_bak.5.mse.bn3_smallscale2.running_var', 'module.base_model.layer3_bak.0.downsample.1.running_var', 'base_model.layer2_bak.0.mse.bn3_smallscale2.weight', 'module.base_model.layer1_bak.0.bn2.weight', 'base_model.layer3_bak.0.bn1.running_var', 'base_model.layer2_bak.1.conv1.weight', 'module.base_model.resnext_layer1.1.bn2.running_var', 'module.base_model.layer4_bak.1.conv1.bias', 'base_model.layer4_bak.1.mse.bn3.weight', 'module.base_model.resnext_layer1.0.conv2.bias', 'base_model.layer3_bak.5.bn3.bias', 'base_model.layer3_bak.0.mse.bn1.running_mean', 'module.base_model.layer3_bak.1.mse.conv3.weight', 'module.base_model.layer3_bak.5.bn2.num_batches_tracked', 'module.base_model.layer2_bak.1.bn2.weight', 'base_model.layer2_bak.2.mse.bn1.running_mean', 'module.base_model.layer3_bak.4.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.layer3_bak.4.bn3.running_var', 'base_model.layer4_bak.1.mse.conv2.weight', 'base_model.resnext_layer1.2.bn3.weight', 'base_model.layer4_bak.2.shift.conv.weight', 'module.base_model.layer1_bak.1.conv2.weight', 'module.base_model.layer4_bak.1.mse.conv3.weight', 'base_model.layer3_bak.4.bn2.running_var', 'module.base_model.layer3_bak.4.bn1.num_batches_tracked', 'base_model.resnext_layer1.0.downsample.0.weight', 'base_model.layer3_bak.2.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.resnext_layer1.2.bn3.running_var', 'base_model.layer3_bak.4.mse.bn3_smallscale2.weight', 'base_model.layer2_bak.1.mse.bn1.bias', 'base_model.layer2_bak.0.mse.bn3_smallscale4.running_mean', 'module.base_model.layer3_bak.5.bn1.weight', 'base_model.layer3_bak.3.mse.bn1.bias', 'base_model.layer4_bak.1.mse.bn3_smallscale4.running_mean', 'module.base_model.layer2_bak.0.bn1.running_mean', 'base_model.layer3_bak.5.bn2.weight', 'base_model.layer3_bak.0.downsample.1.running_mean', 'module.base_model.layer3_bak.3.mse.conv3_smallscale2.weight', 'module.base_model.resnext_layer1.1.bn3.weight', 'base_model.layer2_bak.2.mse.bn3.weight', 'base_model.layer4_bak.0.mse.bn1.weight', 'module.base_model.layer3_bak.3.conv2.bias', 'base_model.resnext_layer1.2.bn2.num_batches_tracked', 'base_model.resnext_layer1.2.bn1.running_mean', 'module.base_model.resnext_layer1.1.bn1.running_mean', 'base_model.layer3_bak.5.mse.bn3_smallscale2.running_mean', 'module.base_model.layer2_bak.0.mse.bn3_smallscale2.running_var', 'module.base_model.layer3_bak.1.conv2.weight', 'base_model.layer3_bak.0.mse.conv3.weight', 'module.base_model.layer4_bak.1.conv3.weight', 'module.base_model.layer4_bak.0.mse.bn3_smallscale4.running_mean', 'module.base_model.layer4_bak.1.bn3.num_batches_tracked', 'base_model.resnext_layer1.0.bn2.num_batches_tracked', 'base_model.layer3_bak.1.bn1.bias', 'base_model.layer2_bak.0.mse.bn1.running_mean', 'module.base_model.layer3_bak.0.bn2.num_batches_tracked', 'base_model.layer4_bak.2.conv2.bias', 'base_model.layer2_bak.2.bn1.weight', 'module.base_model.layer3_bak.3.mse.bn3.num_batches_tracked', 'module.base_model.layer3_bak.5.conv3.weight', 'module.base_model.layer3_bak.4.shift.conv.weight', 'base_model.layer4_bak.1.mse.bn3_smallscale2.running_var', 'module.base_model.layer3_bak.3.mse.bn1.weight', 'base_model.layer3_bak.4.mse.conv2.weight', 'base_model.layer3_bak.4.bn2.running_mean', 'module.base_model.layer4_bak.1.mse.bn1.bias', 'module.base_model.layer3_bak.4.mse.conv2.weight', 'base_model.layer3_bak.1.conv2.weight', 'base_model.layer4_bak.2.mse.conv3.weight', 'module.base_model.layer3_bak.1.mse.bn1.weight', 'module.base_model.layer4_bak.1.mse.conv3_smallscale4.weight', 'base_model.layer1_bak.0.bn3.running_mean', 'base_model.layer3_bak.3.conv2.weight', 'module.base_model.layer1_bak.1.conv1.weight', 'module.base_model.layer2_bak.0.mse.bn3_smallscale4.bias', 'base_model.layer4_bak.2.conv2.weight', 'base_model.layer4_bak.1.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer2_bak.3.mse.bn3.bias', 'base_model.layer3_bak.5.mse.bn3_smallscale4.weight', 'module.base_model.layer2_bak.0.bn3.weight', 'module.base_model.layer2_bak.1.mse.bn3.running_var', 'base_model.layer4_bak.0.mse.bn1.bias', 'module.base_model.layer3_bak.0.bn3.bias', 'base_model.layer2_bak.0.bn3.running_mean', 'base_model.layer2_bak.3.bn3.running_mean', 'module.base_model.layer4_bak.0.downsample.0.weight', 'base_model.layer3_bak.3.conv1.bias', 'module.base_model.layer4_bak.2.conv3.weight', 'module.base_model.layer4_bak.2.bn3.running_mean', 'base_model.layer3_bak.1.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.layer4_bak.1.mse.conv1.weight', 'module.base_model.layer4_bak.2.mse.bn3.num_batches_tracked', 'module.base_model.layer3_bak.3.mse.bn3_smallscale4.weight', 'base_model.layer4_bak.2.mse.conv2.weight', 'base_model.layer3_bak.5.mse.conv3_smallscale2.weight', 'module.base_model.layer2_bak.3.mse.bn3.weight', 'base_model.layer2_bak.3.conv2.bias', 'base_model.layer3_bak.2.mse.bn1.running_var', 'module.base_model.layer2_bak.3.mse.bn3_smallscale4.weight', 'module.base_model.layer2_bak.3.bn3.weight', 'module.base_model.layer3_bak.4.mse.conv3_smallscale2.weight', 'module.base_model.layer4_bak.1.mse.bn3_smallscale2.running_mean', 'module.base_model.layer2_bak.3.conv3.weight', 'module.base_model.layer2_bak.1.mse.conv2.weight', 'base_model.layer2_bak.0.conv2.bias', 'module.base_model.layer4_bak.2.bn3.bias', 'module.base_model.layer3_bak.0.mse.bn3_smallscale2.weight', 'base_model.layer3_bak.0.mse.bn3.running_var', 'base_model.layer3_bak.4.shift.conv.weight', 'base_model.layer3_bak.3.mse.bn1.weight', 'module.base_model.layer1_bak.2.bn3.bias', 'base_model.resnext_layer1.2.bn2.bias', 'base_model.layer3_bak.0.mse.bn3_smallscale2.running_mean', 'module.base_model.layer3_bak.2.conv3.weight', 'module.base_model.layer3_bak.1.mse.bn1.running_mean', 'module.base_model.layer1_bak.0.conv3.weight', 'base_model.layer3_bak.5.bn2.num_batches_tracked', 'module.base_model.layer1_bak.0.bn2.bias', 'base_model.layer3_bak.1.mse.bn3.running_mean', 'base_model.layer4_bak.1.bn1.running_var', 'module.base_model.layer3_bak.3.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer2_bak.3.conv3.weight', 'module.base_model.layer3_bak.3.mse.bn3_smallscale4.bias', 'base_model.resnext_layer1.1.bn2.running_var', 'base_model.layer2_bak.3.bn2.running_mean', 'module.base_model.layer4_bak.2.mse.bn3.running_var', 'module.base_model.layer2_bak.2.conv3.weight', 'module.base_model.layer2_bak.0.mse.conv2.weight', 'module.base_model.layer2_bak.1.mse.bn1.num_batches_tracked', 'module.base_model.layer4_bak.2.bn2.running_mean', 'base_model.layer2_bak.3.mse.conv3_smallscale2.weight', 'base_model.layer3_bak.0.bn1.running_mean', 'module.base_model.layer1_bak.0.downsample.1.weight', 'module.base_model.conv1.bias', 'module.base_model.layer2_bak.1.mse.bn1.weight', 'base_model.layer3_bak.1.bn3.weight', 'base_model.resnext_layer1.1.conv1.weight', 'base_model.layer1_bak.2.conv2.weight', 'module.base_model.layer3_bak.2.mse.bn3.num_batches_tracked', 'module.base_model.layer1_bak.0.bn3.num_batches_tracked', 'base_model.layer4_bak.0.downsample.1.running_var', 'base_model.layer4_bak.2.mse.bn3.num_batches_tracked', 'base_model.layer2_bak.3.mse.bn3_smallscale2.weight', 'base_model.layer3_bak.5.mse.conv2.weight', 'module.base_model.layer3_bak.2.bn2.weight', 'module.base_model.layer3_bak.3.bn1.num_batches_tracked', 'module.base_model.layer3_bak.1.bn3.running_var', 'module.base_model.layer4_bak.1.bn2.weight', 'module.base_model.layer2_bak.1.bn3.num_batches_tracked', 'module.base_model.layer3_bak.2.shift.conv.weight', 'module.base_model.conv1_5.1.num_batches_tracked', 'module.base_model.layer4_bak.0.mse.bn3_smallscale4.running_var', 'base_model.layer3_bak.1.mse.bn3_smallscale2.bias', 'base_model.layer4_bak.1.mse.bn3_smallscale2.weight', 'base_model.layer2_bak.3.mse.bn3_smallscale4.running_var', 'base_model.layer3_bak.3.mse.bn3.num_batches_tracked', 'base_model.layer3_bak.2.conv3.bias', 'module.base_model.layer3_bak.0.mse.conv1.weight', 'base_model.layer3_bak.5.mse.bn3_smallscale2.weight', 'module.base_model.layer2_bak.2.bn2.num_batches_tracked', 'module.base_model.layer3_bak.1.mse.bn3_smallscale4.weight', 'module.base_model.layer4_bak.1.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer2_bak.0.downsample.1.num_batches_tracked', 'module.base_model.layer4_bak.1.bn1.running_mean', 'module.base_model.layer3_bak.0.downsample.1.running_mean', 'module.base_model.layer2_bak.3.mse.bn3.running_var', 'module.base_model.layer4_bak.0.mse.bn3.bias', 'base_model.layer4_bak.0.conv1.bias', 'base_model.layer4_bak.2.mse.bn3_smallscale2.running_var', 'base_model.layer3_bak.4.mse.conv3.weight', 'base_model.layer2_bak.1.mse.bn3_smallscale2.running_var', 'module.base_model.layer3_bak.2.bn3.running_var', 'base_model.layer3_bak.4.bn3.num_batches_tracked', 'base_model.layer1_bak.2.bn3.num_batches_tracked', 'module.base_model.layer3_bak.0.mse.conv3_smallscale2.weight', 'module.base_model.layer1_bak.1.bn1.bias', 'module.base_model.layer2_bak.2.mse.bn3_smallscale4.running_var', 'module.base_model.layer3_bak.5.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.resnext_layer1.0.bn1.weight', 'base_model.resnext_layer1.0.bn1.bias', 'base_model.layer2_bak.1.bn1.running_mean', 'base_model.layer4_bak.2.mse.bn3.running_var', 'module.base_model.layer2_bak.1.mse.bn3_smallscale2.running_mean', 'module.base_model.layer3_bak.2.conv2.bias', 'base_model.layer3_bak.5.bn3.running_var', 'base_model.layer4_bak.1.mse.conv3.weight', 'base_model.layer1_bak.2.bn1.bias', 'module.base_model.layer4_bak.1.mse.conv3_smallscale2.weight', 'base_model.layer2_bak.3.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer3_bak.3.mse.conv1.weight', 'base_model.layer4_bak.0.bn2.bias', 'module.base_model.layer3_bak.0.conv3.weight', 'base_model.layer3_bak.5.mse.bn3_smallscale2.bias', 'module.base_model.layer2_bak.1.conv1.bias', 'module.base_model.layer3_bak.1.mse.bn3_smallscale2.running_var', 'base_model.layer3_bak.5.conv3.bias', 'module.base_model.layer2_bak.1.mse.bn3.running_mean', 'module.base_model.layer4_bak.1.bn1.weight', 'base_model.layer2_bak.1.bn1.bias', 'base_model.layer1_bak.2.bn2.num_batches_tracked', 'module.base_model.layer2_bak.2.conv1.weight', 'module.base_model.layer3_bak.5.mse.bn1.running_var', 'base_model.layer4_bak.0.bn2.running_mean', 'base_model.layer3_bak.2.mse.bn3_smallscale4.weight', 'module.base_model.layer4_bak.2.mse.bn3_smallscale4.running_var', 'base_model.layer3_bak.0.mse.bn3.num_batches_tracked', 'module.base_model.layer3_bak.4.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.layer3_bak.1.bn1.weight', 'base_model.layer2_bak.1.shift.conv.weight', 'module.base_model.resnext_layer1.2.conv1.weight', 'module.base_model.layer1_bak.2.conv1.weight', 'base_model.layer2_bak.2.mse.bn3.running_var', 'module.base_model.layer1_bak.2.bn3.running_mean', 'base_model.layer2_bak.0.downsample.0.weight', 'base_model.layer3_bak.5.mse.bn3.running_var', 'base_model.layer3_bak.1.bn1.weight', 'module.base_model.layer3_bak.5.bn1.running_mean', 'base_model.layer2_bak.2.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.layer4_bak.0.bn2.num_batches_tracked', 'module.base_model.layer4_bak.2.mse.bn3_smallscale2.running_var', 'base_model.layer2_bak.0.conv1.bias', 'module.base_model.layer2_bak.2.mse.bn3_smallscale2.bias', 'module.base_model.layer3_bak.1.bn1.running_mean', 'module.base_model.layer3_bak.5.conv1.weight', 'base_model.layer2_bak.1.mse.bn1.running_mean', 'module.base_model.layer3_bak.0.bn3.running_var', 'module.base_model.layer3_bak.1.conv1.bias', 'base_model.layer4_bak.2.conv3.weight', 'base_model.layer3_bak.1.conv2.bias', 'base_model.layer4_bak.0.mse.bn3.running_mean', 'base_model.layer3_bak.2.mse.bn3.running_mean', 'module.base_model.layer4_bak.2.mse.bn3.weight', 'base_model.layer2_bak.1.bn1.weight', 'base_model.layer1_bak.0.downsample.0.weight', 'base_model.layer2_bak.2.bn3.running_mean', 'base_model.layer2_bak.2.mse.bn1.bias', 'module.base_model.layer2_bak.1.bn3.bias', 'module.base_model.layer4_bak.1.bn3.running_var', 'base_model.layer2_bak.2.conv2.bias', 'module.base_model.conv1_5.1.running_var', 'module.base_model.resnext_layer1.0.conv2.weight', 'module.base_model.layer4_bak.0.downsample.1.running_mean', 'module.base_model.layer4_bak.2.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer4_bak.2.mse.bn3.weight', 'base_model.resnext_layer1.1.bn1.running_var', 'module.base_model.resnext_layer1.0.downsample.1.num_batches_tracked', 'base_model.layer1_bak.0.bn2.num_batches_tracked', 'base_model.layer1_bak.1.bn2.bias', 'module.base_model.layer1_bak.0.conv1.weight', 'base_model.layer3_bak.3.bn2.running_mean', 'base_model.layer1_bak.0.downsample.1.running_mean', 'module.base_model.layer2_bak.0.downsample.0.bias', 'base_model.layer3_bak.2.conv2.weight', 'base_model.layer2_bak.3.mse.bn3_smallscale2.running_var', 'base_model.layer3_bak.5.mse.bn3_smallscale4.bias', 'module.base_model.layer2_bak.1.conv3.bias', 'module.base_model.layer3_bak.4.bn3.bias', 'module.base_model.layer2_bak.0.mse.bn1.weight', 'module.base_model.layer4_bak.0.bn1.bias', 'module.base_model.layer3_bak.3.bn3.running_var', 'module.base_model.layer3_bak.1.mse.conv3_smallscale2.weight', 'base_model.layer4_bak.2.mse.bn3_smallscale2.running_mean', 'base_model.layer1_bak.1.bn2.weight', 'base_model.layer1_bak.0.bn2.weight', 'base_model.layer3_bak.4.mse.conv3_smallscale4.weight', 'module.base_model.layer3_bak.3.mse.conv2.weight', 'module.base_model.layer3_bak.4.bn1.bias', 'module.base_model.layer2_bak.0.downsample.1.bias', 'module.base_model.layer3_bak.4.mse.bn3_smallscale4.weight', 'module.base_model.layer4_bak.1.mse.bn3_smallscale4.running_var', 'base_model.layer3_bak.1.conv1.weight', 'module.base_model.layer3_bak.1.mse.bn3_smallscale2.running_mean', 'base_model.resnext_layer1.0.bn3.bias', 'module.base_model.layer3_bak.1.conv2.bias', 'base_model.layer2_bak.3.bn3.weight', 'base_model.layer3_bak.1.mse.conv2.weight', 'module.base_model.layer3_bak.3.conv3.weight', 'base_model.layer3_bak.0.mse.bn1.running_var', 'base_model.layer4_bak.0.conv1.weight', 'module.base_model.layer4_bak.2.bn1.running_var', 'base_model.layer1_bak.0.bn2.running_var', 'module.base_model.layer2_bak.3.mse.bn1.weight', 'module.base_model.layer4_bak.0.shift.conv.weight', 'module.base_model.layer2_bak.2.bn3.weight', 'base_model.layer3_bak.5.bn3.running_mean', 'base_model.layer2_bak.3.mse.bn1.num_batches_tracked', 'module.base_model.layer4_bak.0.mse.bn3.running_mean', 'base_model.layer3_bak.1.mse.bn1.weight', 'base_model.layer3_bak.2.bn3.weight', 'base_model.layer1_bak.2.bn2.running_var', 'base_model.conv1_temp.weight', 'module.base_model.layer3_bak.4.mse.conv3.weight', 'module.base_model.layer2_bak.3.bn1.bias', 'module.base_model.layer1_bak.0.bn1.running_mean', 'module.base_model.layer4_bak.0.downsample.0.bias', 'base_model.layer4_bak.0.bn1.weight', 'module.base_model.layer2_bak.1.mse.bn3_smallscale4.bias', 'base_model.layer1_bak.1.bn2.running_mean', 'module.base_model.layer3_bak.2.bn3.num_batches_tracked', 'base_model.layer3_bak.4.bn3.bias', 'module.base_model.conv1_temp.weight', 'module.base_model.layer4_bak.0.mse.bn3.running_var', 'module.base_model.layer2_bak.3.mse.conv1.weight', 'base_model.layer2_bak.1.bn2.running_mean', 'base_model.layer2_bak.0.mse.conv3_smallscale2.weight', 'base_model.layer3_bak.4.bn3.running_mean', 'module.base_model.layer2_bak.1.mse.conv1.weight', 'module.base_model.layer3_bak.3.bn3.running_mean', 'base_model.layer1_bak.2.bn1.running_var', 'module.base_model.layer3_bak.1.mse.bn1.num_batches_tracked', 'base_model.layer4_bak.0.bn3.bias', 'module.base_model.layer2_bak.2.bn2.running_var', 'module.base_model.resnext_layer1.0.bn2.weight', 'base_model.layer2_bak.0.conv3.bias', 'base_model.layer4_bak.2.mse.bn3_smallscale4.bias', 'base_model.layer3_bak.5.mse.bn3.bias', 'base_model.layer1_bak.2.conv2.bias', 'base_model.layer4_bak.2.mse.bn3.bias', 'module.base_model.layer3_bak.5.bn3.running_var', 'base_model.layer3_bak.3.bn1.running_mean', 'base_model.layer3_bak.1.mse.bn1.running_var', 'base_model.layer4_bak.0.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer4_bak.1.conv3.bias', 'base_model.resnext_layer1.2.conv3.bias', 'module.base_model.layer2_bak.0.mse.conv3_smallscale2.weight', 'base_model.layer1_bak.0.bn1.num_batches_tracked', 'module.base_model.layer1_bak.0.conv1.bias', 'module.base_model.layer2_bak.3.conv2.bias', 'base_model.layer2_bak.0.mse.bn1.running_var', 'base_model.layer2_bak.1.bn2.weight', 'module.base_model.layer3_bak.3.bn1.bias', 'base_model.layer2_bak.0.bn3.num_batches_tracked', 'base_model.layer3_bak.3.bn3.weight', 'base_model.layer2_bak.1.mse.bn1.weight', 'module.base_model.layer2_bak.1.bn1.num_batches_tracked', 'base_model.resnext_layer1.0.bn2.weight', 'base_model.layer4_bak.0.bn1.num_batches_tracked', 'base_model.layer4_bak.2.conv3.bias', 'module.base_model.layer3_bak.3.mse.bn1.bias', 'module.base_model.layer3_bak.2.conv1.weight', 'base_model.resnext_layer1.1.conv3.weight', 'base_model.layer3_bak.1.bn3.num_batches_tracked', 'module.base_model.layer3_bak.5.bn3.num_batches_tracked', 'base_model.layer3_bak.4.mse.bn3.bias', 'module.base_model.layer3_bak.2.mse.bn1.running_var', 'base_model.layer1_bak.2.bn2.running_mean', 'base_model.layer3_bak.2.bn2.weight', 'base_model.layer3_bak.4.mse.bn3_smallscale2.running_mean', 'base_model.layer3_bak.5.mse.bn3.weight', 'base_model.layer3_bak.1.mse.bn3_smallscale2.running_var', 'module.base_model.layer3_bak.2.conv1.bias', 'base_model.layer3_bak.4.mse.bn3_smallscale4.running_mean', 'base_model.layer3_bak.3.bn1.bias', 'module.base_model.layer4_bak.2.mse.conv3_smallscale2.weight', 'base_model.layer2_bak.0.mse.bn3_smallscale4.weight', 'base_model.layer3_bak.2.bn3.num_batches_tracked', 'base_model.layer4_bak.0.bn3.running_var', 'module.base_model.layer2_bak.2.bn3.running_mean', 'module.base_model.layer4_bak.0.mse.bn3_smallscale4.weight', 'module.base_model.layer3_bak.5.mse.bn3_smallscale4.running_mean', 'base_model.layer3_bak.0.bn3.running_var', 'base_model.layer4_bak.2.mse.conv3_smallscale4.weight', 'module.base_model.layer2_bak.1.mse.bn3_smallscale2.weight', 'base_model.layer3_bak.0.mse.conv1.weight', 'module.base_model.layer2_bak.0.mse.conv3.weight', 'module.base_model.layer4_bak.0.mse.bn3_smallscale2.running_mean', 'module.base_model.layer3_bak.2.mse.conv3_smallscale2.weight', 'base_model.layer3_bak.3.mse.bn3_smallscale2.bias', 'module.base_model.layer3_bak.2.mse.bn1.running_mean', 'base_model.layer3_bak.2.mse.bn1.running_mean', 'module.base_model.layer3_bak.5.mse.bn3.bias', 'module.base_model.resnext_layer1.1.bn1.weight', 'module.base_model.conv1_temp.bias', 'base_model.layer3_bak.2.mse.conv3.weight', 'base_model.layer3_bak.4.bn2.num_batches_tracked', 'module.base_model.resnext_layer1.0.bn1.num_batches_tracked', 'base_model.layer3_bak.3.bn3.running_mean', 'module.base_model.layer4_bak.1.bn3.weight', 'base_model.resnext_layer1.2.bn2.running_var', 'base_model.layer2_bak.0.bn1.running_var', 'base_model.layer2_bak.1.mse.conv3_smallscale2.weight', 'module.base_model.layer4_bak.0.bn3.running_mean', 'base_model.layer3_bak.3.mse.conv3.weight', 'base_model.layer4_bak.0.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer2_bak.3.mse.bn3.running_var', 'base_model.resnext_layer1.2.bn2.running_mean', 'module.base_model.bn1.running_mean', 'module.base_model.layer3_bak.0.bn1.running_mean', 'base_model.layer4_bak.1.bn2.running_mean', 'base_model.layer2_bak.2.mse.bn3.num_batches_tracked', 'base_model.layer1_bak.0.downsample.0.bias', 'base_model.layer2_bak.0.mse.conv1.weight', 'base_model.layer2_bak.1.bn1.num_batches_tracked', 'module.base_model.layer2_bak.1.bn3.weight', 'base_model.layer2_bak.1.mse.bn3.weight', 'base_model.layer4_bak.2.bn2.running_mean', 'base_model.layer1_bak.0.bn1.running_var', 'base_model.layer3_bak.1.conv1.bias', 'base_model.layer1_bak.0.bn3.bias', 'module.base_model.layer2_bak.3.conv2.weight', 'module.base_model.layer1_bak.1.bn1.weight', 'base_model.layer3_bak.3.bn3.running_var', 'base_model.resnext_layer1.0.bn2.bias', 'base_model.layer4_bak.0.mse.bn3_smallscale4.running_mean', 'base_model.layer3_bak.0.downsample.1.bias', 'base_model.layer1_bak.1.bn1.running_var', 'base_model.layer2_bak.2.bn1.num_batches_tracked', 'base_model.layer3_bak.3.bn2.running_var', 'module.base_model.layer2_bak.2.mse.bn1.running_mean', 'module.base_model.layer2_bak.0.mse.bn3_smallscale4.running_mean', 'module.base_model.layer2_bak.0.bn3.running_mean', 'module.base_model.layer3_bak.4.mse.bn3.bias', 'module.base_model.layer2_bak.0.bn2.running_mean', 'base_model.resnext_layer1.2.bn2.weight', 'base_model.layer1_bak.1.conv3.weight', 'module.base_model.layer2_bak.2.mse.bn3_smallscale4.weight', 'module.base_model.layer3_bak.2.mse.conv1.weight', 'module.base_model.layer3_bak.3.mse.bn3.running_mean', 'base_model.layer3_bak.0.bn2.weight', 'module.base_model.resnext_layer1.0.conv1.weight', 'module.base_model.layer3_bak.4.mse.bn3_smallscale2.bias', 'module.base_model.layer4_bak.1.mse.conv2.weight', 'base_model.layer2_bak.1.conv2.bias', 'module.base_model.layer4_bak.0.mse.bn3_smallscale2.num_batches_tracked', 'base_model.resnext_layer1.0.downsample.1.weight', 'base_model.layer3_bak.1.conv3.weight', 'module.base_model.layer2_bak.0.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.layer2_bak.0.downsample.1.num_batches_tracked', 'base_model.layer2_bak.3.conv3.bias', 'base_model.layer3_bak.0.mse.bn3_smallscale2.weight', 'base_model.layer3_bak.4.mse.bn3_smallscale4.weight', 'module.base_model.layer2_bak.0.mse.bn3.running_mean', 'module.base_model.layer2_bak.2.mse.conv1.weight', 'module.base_model.layer3_bak.4.conv2.bias', 'base_model.layer1_bak.0.bn3.num_batches_tracked', 'module.base_model.layer4_bak.0.bn1.running_mean', 'module.base_model.resnext_layer1.1.bn3.running_var', 'base_model.layer3_bak.0.mse.bn3_smallscale4.weight', 'module.base_model.layer2_bak.2.mse.bn3.running_mean', 'module.base_model.layer1_bak.1.bn2.weight', 'module.base_model.layer2_bak.3.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.layer4_bak.0.bn2.weight', 'module.base_model.layer3_bak.1.mse.bn3_smallscale2.bias', 'base_model.layer4_bak.0.mse.bn3.num_batches_tracked', 'base_model.layer3_bak.5.bn3.weight', 'base_model.layer3_bak.5.mse.conv3.weight', 'base_model.layer1_bak.2.bn3.bias', 'base_model.resnext_layer1.1.bn3.running_mean', 'base_model.layer3_bak.2.conv2.bias', 'module.base_model.layer1_bak.0.conv3.bias', 'base_model.layer2_bak.0.conv1.weight', 'module.base_model.layer3_bak.2.mse.bn3_smallscale4.bias', 'base_model.layer3_bak.2.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer4_bak.0.mse.bn3_smallscale2.weight', 'module.base_model.layer3_bak.3.bn1.running_var', 'base_model.bn1.running_var', 'module.base_model.layer2_bak.0.conv1.weight', 'base_model.layer3_bak.2.bn1.running_mean', 'base_model.layer4_bak.1.bn1.running_mean', 'module.base_model.layer3_bak.4.mse.conv1.weight', 'module.base_model.layer3_bak.3.bn2.num_batches_tracked', 'base_model.layer1_bak.0.downsample.1.num_batches_tracked', 'base_model.layer3_bak.4.conv3.weight', 'module.base_model.layer2_bak.2.conv2.bias', 'base_model.layer3_bak.2.mse.conv3_smallscale2.weight', 'module.base_model.layer3_bak.1.bn2.running_mean', 'base_model.resnext_layer1.0.bn1.weight', 'base_model.layer3_bak.4.mse.bn1.bias', 'module.base_model.resnext_layer1.2.bn1.weight', 'base_model.layer3_bak.4.bn3.running_var', 'base_model.resnext_layer1.2.bn3.running_mean', 'base_model.layer2_bak.0.mse.bn3.bias', 'module.base_model.layer3_bak.2.bn1.running_mean', 'base_model.layer2_bak.1.bn3.bias', 'base_model.layer3_bak.2.mse.bn3_smallscale2.running_var', 'module.base_model.layer4_bak.0.bn1.num_batches_tracked', 'module.base_model.layer3_bak.5.mse.bn1.bias', 'base_model.layer3_bak.1.mse.conv3.weight', 'base_model.layer2_bak.2.mse.conv3_smallscale4.weight', 'module.base_model.layer4_bak.2.bn1.num_batches_tracked', 'base_model.layer4_bak.0.downsample.1.num_batches_tracked', 'module.base_model.layer3_bak.2.mse.bn3.running_mean', 'base_model.layer2_bak.0.shift.conv.weight', 'module.base_model.layer2_bak.3.bn3.num_batches_tracked', 'module.base_model.layer2_bak.3.mse.bn3.bias', 'module.base_model.layer3_bak.4.bn2.weight', 'base_model.layer3_bak.5.bn2.running_mean', 'module.base_model.layer4_bak.2.conv1.bias', 'module.base_model.resnext_layer1.1.bn1.bias', 'module.base_model.layer3_bak.0.bn3.running_mean', 'base_model.layer4_bak.2.mse.bn1.bias', 'module.base_model.layer3_bak.4.bn1.weight', 'module.base_model.layer2_bak.3.mse.bn3_smallscale2.running_var', 'module.base_model.layer3_bak.0.bn2.running_mean', 'module.base_model.layer3_bak.5.mse.bn3_smallscale2.running_mean', 'module.base_model.layer2_bak.3.bn2.weight', 'module.base_model.layer4_bak.2.mse.bn1.running_var', 'base_model.layer3_bak.1.bn2.running_var', 'base_model.layer3_bak.1.bn1.running_mean', 'base_model.layer3_bak.5.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer4_bak.0.bn3.running_mean', 'base_model.layer4_bak.1.mse.bn3.num_batches_tracked', 'base_model.resnext_layer1.0.downsample.1.running_var', 'base_model.resnext_layer1.2.bn1.weight', 'module.base_model.layer1_bak.1.bn2.bias', 'base_model.layer3_bak.2.mse.conv3_smallscale4.weight', 'module.base_model.layer3_bak.0.mse.bn3_smallscale4.bias', 'base_model.layer2_bak.2.bn2.running_mean', 'base_model.layer3_bak.4.mse.bn1.running_var', 'module.base_model.layer3_bak.2.mse.bn3_smallscale4.weight', 'module.base_model.layer2_bak.0.mse.bn3.num_batches_tracked', 'base_model.layer3_bak.2.mse.bn1.num_batches_tracked', 'module.base_model.layer2_bak.3.bn2.bias', 'module.base_model.resnext_layer1.2.bn1.bias', 'base_model.layer3_bak.1.mse.bn3_smallscale4.running_mean', 'base_model.layer3_bak.1.mse.bn3.num_batches_tracked', 'module.base_model.resnext_layer1.2.bn1.num_batches_tracked', 'module.base_model.layer3_bak.0.conv1.weight', 'module.base_model.layer2_bak.0.mse.bn3_smallscale2.running_mean', 'base_model.resnext_layer1.1.bn2.running_mean', 'module.base_model.layer3_bak.3.conv3.bias', 'base_model.layer3_bak.0.bn2.bias', 'base_model.layer3_bak.2.bn1.running_var', 'module.base_model.layer4_bak.1.bn1.running_var', 'base_model.layer2_bak.0.mse.bn3.num_batches_tracked', 'base_model.layer2_bak.1.bn1.running_var', 'module.base_model.layer3_bak.5.bn3.running_mean', 'module.base_model.layer3_bak.0.bn2.running_var', 'base_model.layer2_bak.1.mse.bn3.bias', 'module.base_model.layer1_bak.2.conv3.weight', 'base_model.layer3_bak.4.mse.bn3.num_batches_tracked', 'module.base_model.layer4_bak.0.downsample.1.running_var', 'module.base_model.layer4_bak.0.downsample.1.weight', 'module.base_model.layer3_bak.0.mse.conv3.weight', 'module.base_model.layer3_bak.5.bn3.bias', 'base_model.layer3_bak.1.mse.bn1.running_mean', 'base_model.layer4_bak.2.conv1.weight', 'base_model.layer2_bak.3.bn1.running_var', 'base_model.layer2_bak.3.mse.bn1.bias', 'module.base_model.layer2_bak.0.conv2.bias', 'module.base_model.layer2_bak.0.mse.bn3_smallscale4.running_var', 'module.base_model.layer4_bak.1.mse.bn3.weight', 'base_model.layer4_bak.2.mse.conv3_smallscale2.weight', 'module.base_model.layer4_bak.2.mse.bn3_smallscale2.running_mean', 'module.base_model.layer2_bak.3.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.layer2_bak.3.mse.bn1.running_var', 'module.base_model.layer3_bak.5.mse.bn1.num_batches_tracked', 'base_model.resnext_layer1.2.bn3.bias', 'module.base_model.layer2_bak.2.bn1.bias', 'base_model.layer4_bak.0.mse.conv2.weight', 'module.base_model.resnext_layer1.0.bn1.running_var', 'module.base_model.layer1_bak.0.bn1.bias', 'module.base_model.layer4_bak.0.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer3_bak.5.bn1.bias', 'module.base_model.layer4_bak.2.mse.bn1.num_batches_tracked', 'module.base_model.layer2_bak.1.bn2.bias', 'module.base_model.layer4_bak.2.conv2.bias', 'module.base_model.layer3_bak.3.bn2.weight', 'base_model.layer3_bak.2.mse.bn3_smallscale2.bias', 'base_model.layer4_bak.2.bn1.running_var', 'module.base_model.layer2_bak.0.downsample.1.running_mean', 'module.base_model.resnext_layer1.2.bn3.bias', 'base_model.layer3_bak.3.bn3.bias', 'base_model.layer4_bak.0.conv3.bias', 'module.base_model.layer1_bak.0.downsample.0.bias', 'module.base_model.resnext_layer1.2.conv3.bias', 'module.base_model.layer2_bak.0.bn2.num_batches_tracked', 'module.base_model.conv1_5.1.bias', 'module.base_model.resnext_layer1.0.downsample.1.bias', 'module.base_model.layer3_bak.0.mse.bn3_smallscale4.running_var', 'base_model.resnext_layer1.2.bn3.num_batches_tracked', 'base_model.layer2_bak.2.mse.conv3_smallscale2.weight', 'module.base_model.layer2_bak.3.bn3.bias', 'module.base_model.layer2_bak.2.bn2.bias', 'base_model.layer2_bak.2.bn2.running_var', 'base_model.layer1_bak.1.bn1.running_mean', 'base_model.layer3_bak.1.mse.conv3_smallscale2.weight', 'module.base_model.layer4_bak.0.bn1.running_var', 'base_model.layer4_bak.0.downsample.1.bias', 'base_model.layer4_bak.0.mse.bn1.num_batches_tracked', 'base_model.layer1_bak.0.downsample.1.running_var', 'new_fc.bias', 'module.base_model.layer3_bak.5.mse.bn1.weight', 'base_model.layer2_bak.1.mse.bn3_smallscale4.bias', 'module.base_model.layer2_bak.1.mse.bn1.running_mean', 'module.base_model.layer1_bak.1.conv3.bias', 'base_model.layer2_bak.0.mse.conv2.weight', 'base_model.layer3_bak.5.mse.bn3_smallscale2.running_var', 'module.base_model.layer3_bak.3.mse.bn3_smallscale4.running_var', 'base_model.layer4_bak.1.mse.bn3_smallscale2.bias', 'module.base_model.layer3_bak.4.mse.bn3_smallscale4.bias', 'module.base_model.layer3_bak.5.bn2.running_var', 'module.base_model.layer3_bak.2.mse.bn3.running_var', 'base_model.resnext_layer1.1.conv2.weight', 'module.base_model.layer4_bak.1.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.layer1_bak.2.bn2.weight', 'base_model.layer4_bak.1.mse.bn3_smallscale4.bias', 'module.base_model.layer2_bak.0.conv3.bias', 'module.base_model.layer3_bak.4.bn2.running_mean', 'module.base_model.layer2_bak.3.bn2.running_var', 'base_model.resnext_layer1.2.bn3.running_var', 'module.base_model.layer2_bak.0.bn1.bias', 'module.base_model.layer2_bak.0.mse.conv1.weight', 'module.base_model.layer3_bak.0.conv2.bias', 'module.base_model.layer3_bak.5.bn3.weight', 'module.base_model.layer2_bak.1.conv1.weight', 'module.base_model.layer1_bak.2.conv2.weight', 'module.base_model.layer2_bak.3.mse.bn3_smallscale4.running_var', 'base_model.layer3_bak.4.mse.bn3_smallscale4.running_var', 'base_model.layer3_bak.0.shift.conv.weight', 'module.base_model.layer2_bak.3.mse.bn3_smallscale4.bias', 'base_model.layer3_bak.4.bn2.bias', 'base_model.resnext_layer1.2.bn1.running_var', 'module.base_model.layer1_bak.2.bn3.running_var', 'module.base_model.layer4_bak.0.bn3.bias', 'base_model.layer3_bak.5.conv3.weight', 'module.base_model.layer2_bak.2.mse.bn3_smallscale2.weight', 'module.base_model.layer2_bak.3.conv1.weight', 'module.base_model.layer3_bak.2.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.layer4_bak.0.conv2.bias', 'base_model.layer2_bak.2.mse.bn3_smallscale2.running_var', 'base_model.layer3_bak.2.conv1.bias', 'module.base_model.layer2_bak.3.bn3.running_mean', 'base_model.layer3_bak.5.mse.conv3_smallscale4.weight', 'module.base_model.layer3_bak.0.mse.bn1.running_mean', 'base_model.layer2_bak.3.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer2_bak.2.conv3.weight', 'module.base_model.conv1_5.0.weight', 'base_model.layer2_bak.1.bn3.running_mean', 'module.base_model.layer2_bak.2.mse.bn3.num_batches_tracked', 'base_model.layer2_bak.0.mse.bn3_smallscale2.bias', 'module.base_model.layer3_bak.3.bn2.bias', 'module.base_model.resnext_layer1.2.conv1.bias', 'base_model.layer3_bak.4.bn1.running_var', 'module.base_model.resnext_layer1.1.bn1.num_batches_tracked', 'module.base_model.layer2_bak.1.shift.conv.weight', 'module.base_model.layer3_bak.2.bn1.running_var', 'base_model.layer2_bak.0.bn2.running_mean', 'module.base_model.layer3_bak.1.mse.conv1.weight', 'base_model.layer3_bak.3.mse.conv3_smallscale4.weight', 'module.base_model.layer3_bak.1.mse.bn3.bias', 'base_model.layer3_bak.2.mse.bn3.bias', 'module.base_model.layer3_bak.5.shift.conv.weight', 'module.base_model.layer4_bak.0.downsample.1.num_batches_tracked', 'base_model.layer3_bak.1.bn2.running_mean', 'module.base_model.layer2_bak.3.mse.conv3_smallscale4.weight', 'module.base_model.layer4_bak.1.mse.bn3_smallscale2.weight', 'module.base_model.layer3_bak.4.mse.bn3_smallscale4.running_var', 'module.base_model.layer1_bak.1.bn3.num_batches_tracked', 'base_model.layer4_bak.0.mse.conv3.weight', 'base_model.layer2_bak.1.bn2.bias', 'module.base_model.layer2_bak.2.bn3.running_var', 'base_model.layer2_bak.3.bn3.bias', 'module.base_model.layer2_bak.2.mse.bn1.bias', 'module.base_model.layer2_bak.3.conv3.bias', 'module.base_model.layer3_bak.2.mse.bn3.weight', 'base_model.layer3_bak.5.bn1.num_batches_tracked', 'base_model.layer2_bak.1.mse.bn3_smallscale4.weight', 'base_model.layer2_bak.1.bn2.num_batches_tracked', 'module.base_model.layer3_bak.5.conv1.bias', 'module.base_model.layer4_bak.2.mse.bn1.running_mean', 'base_model.bn1.num_batches_tracked', 'base_model.layer2_bak.2.conv3.bias', 'base_model.layer2_bak.0.mse.bn3_smallscale4.bias', 'module.base_model.layer2_bak.0.downsample.0.weight', 'module.base_model.layer3_bak.3.shift.conv.weight', 'base_model.layer2_bak.0.mse.conv3_smallscale4.weight', 'module.base_model.resnext_layer1.2.bn2.num_batches_tracked', 'module.base_model.layer3_bak.1.mse.bn3_smallscale4.running_var', 'base_model.resnext_layer1.1.bn1.num_batches_tracked', 'base_model.layer2_bak.1.mse.conv2.weight', 'module.base_model.layer4_bak.1.bn1.bias', 'module.base_model.layer4_bak.0.mse.bn1.running_mean', 'module.base_model.layer3_bak.1.conv1.weight', 'module.base_model.resnext_layer1.1.conv2.bias', 'module.base_model.layer3_bak.2.bn2.running_mean', 'module.base_model.layer4_bak.2.mse.bn3_smallscale4.bias', 'base_model.layer4_bak.0.mse.bn3_smallscale4.running_var', 'base_model.layer2_bak.3.mse.bn1.weight', 'base_model.layer4_bak.1.mse.bn1.weight', 'module.base_model.layer3_bak.2.mse.bn3_smallscale2.running_var', 'base_model.layer2_bak.1.conv3.bias', 'module.base_model.layer3_bak.2.bn3.running_mean', 'module.base_model.layer4_bak.2.mse.bn3_smallscale2.weight', 'module.base_model.layer4_bak.2.conv3.bias', 'module.new_fc.bias', 'base_model.layer2_bak.0.mse.bn3.weight', 'base_model.layer3_bak.4.mse.bn3.running_var', 'base_model.layer3_bak.3.mse.bn3_smallscale4.running_var', 'module.base_model.layer1_bak.2.bn1.weight', 'module.base_model.layer2_bak.0.bn2.bias', 'module.base_model.layer3_bak.5.bn2.weight', 'base_model.layer1_bak.2.conv3.weight', 'base_model.layer4_bak.2.mse.bn3_smallscale4.running_mean', 'base_model.layer2_bak.2.mse.bn1.num_batches_tracked', 'base_model.layer4_bak.1.bn3.running_mean', 'base_model.layer3_bak.1.bn3.running_var', 'base_model.layer2_bak.3.mse.bn3.running_mean', 'module.base_model.layer2_bak.3.conv1.bias', 'module.base_model.layer1_bak.0.downsample.1.bias', 'module.base_model.layer3_bak.5.bn2.bias', 'base_model.layer2_bak.0.bn1.num_batches_tracked', 'module.base_model.layer2_bak.1.bn1.running_var', 'module.base_model.layer4_bak.2.bn2.bias', 'module.base_model.layer3_bak.4.mse.bn1.weight', 'module.base_model.layer2_bak.1.mse.bn3_smallscale4.running_var', 'base_model.layer1_bak.2.bn1.num_batches_tracked', 'base_model.layer2_bak.1.mse.bn3_smallscale2.weight', 'base_model.layer2_bak.3.bn1.running_mean', 'base_model.layer3_bak.2.conv3.weight', 'base_model.layer4_bak.1.bn2.num_batches_tracked', 'base_model.layer4_bak.2.mse.bn1.num_batches_tracked', 'module.base_model.layer3_bak.5.mse.conv3_smallscale4.weight', 'base_model.layer3_bak.3.mse.bn3_smallscale2.running_mean', 'base_model.layer3_bak.0.bn3.bias', 'base_model.layer3_bak.3.mse.bn1.running_var', 'module.base_model.layer4_bak.1.mse.bn3.bias', 'base_model.layer4_bak.0.mse.bn1.running_var', 'base_model.layer3_bak.3.mse.bn3_smallscale2.weight', 'module.base_model.layer2_bak.2.mse.bn1.weight', 'module.base_model.layer3_bak.4.conv3.bias', 'base_model.layer3_bak.4.conv3.bias', 'base_model.layer3_bak.2.mse.bn3.num_batches_tracked', 'base_model.layer3_bak.5.mse.bn1.running_mean', 'base_model.layer4_bak.0.mse.bn3.weight', 'module.base_model.layer1_bak.0.bn3.weight', 'base_model.layer4_bak.1.conv2.weight', 'module.base_model.resnext_layer1.1.conv1.weight', 'module.base_model.layer2_bak.0.bn2.running_var', 'module.base_model.resnext_layer1.1.bn2.num_batches_tracked', 'module.base_model.layer2_bak.2.mse.bn3_smallscale4.num_batches_tracked', 'base_model.layer1_bak.1.conv3.bias', 'base_model.layer3_bak.0.bn2.running_mean', 'base_model.layer2_bak.1.mse.bn3_smallscale2.running_mean', 'module.base_model.layer2_bak.3.mse.bn1.running_mean', 'base_model.layer2_bak.1.mse.bn3_smallscale4.running_mean', 'module.base_model.layer3_bak.0.bn1.weight', 'module.base_model.layer3_bak.4.mse.bn3.running_var', 'base_model.layer4_bak.0.bn2.running_var', 'base_model.layer3_bak.4.mse.bn3_smallscale4.bias', 'base_model.layer1_bak.2.bn3.weight', 'base_model.resnext_layer1.0.bn2.running_var', 'module.base_model.resnext_layer1.2.bn2.weight', 'module.base_model.layer2_bak.0.bn1.weight', 'base_model.layer2_bak.3.conv1.bias', 'base_model.layer3_bak.1.mse.conv3_smallscale4.weight', 'base_model.layer2_bak.3.mse.bn1.running_var', 'module.base_model.layer3_bak.3.mse.bn3_smallscale2.running_var', 'base_model.layer4_bak.2.bn3.bias', 'module.base_model.layer3_bak.5.bn1.num_batches_tracked', 'module.base_model.layer2_bak.0.bn2.weight', 'module.base_model.layer4_bak.1.bn3.running_mean', 'base_model.layer4_bak.2.bn1.bias', 'module.base_model.layer3_bak.1.bn3.num_batches_tracked', 'module.base_model.layer3_bak.4.mse.bn1.bias', 'base_model.layer2_bak.3.mse.bn3_smallscale4.weight', 'base_model.layer3_bak.0.downsample.1.running_var', 'base_model.layer4_bak.0.mse.bn3_smallscale4.weight', 'base_model.conv1_5.0.weight', 'base_model.layer4_bak.0.bn3.num_batches_tracked', 'module.base_model.layer4_bak.2.shift.conv.weight', 'module.base_model.layer3_bak.3.mse.bn1.num_batches_tracked', 'base_model.layer4_bak.2.bn1.weight', 'base_model.layer3_bak.1.mse.bn3_smallscale4.bias', 'module.base_model.layer3_bak.1.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.layer4_bak.2.bn2.running_var', 'base_model.layer3_bak.1.mse.bn3.bias', 'base_model.layer3_bak.4.mse.bn1.running_mean', 'module.base_model.layer4_bak.2.mse.bn3_smallscale4.running_mean', 'module.base_model.layer2_bak.2.mse.conv3_smallscale4.weight', 'module.base_model.layer3_bak.0.mse.bn1.weight', 'module.base_model.layer3_bak.3.mse.bn3_smallscale2.weight', 'base_model.layer2_bak.3.mse.bn3.num_batches_tracked', 'module.base_model.layer3_bak.2.mse.conv2.weight', 'module.base_model.layer4_bak.0.bn3.weight', 'base_model.layer4_bak.0.mse.conv3_smallscale2.weight', 'base_model.layer2_bak.0.mse.bn1.weight', 'module.base_model.layer4_bak.1.bn1.num_batches_tracked', 'module.base_model.layer3_bak.0.mse.bn3_smallscale2.running_var', 'module.base_model.layer3_bak.4.mse.bn1.running_mean', 'module.base_model.layer3_bak.1.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.layer2_bak.0.bn3.bias', 'base_model.layer4_bak.1.mse.bn1.running_var', 'base_model.resnext_layer1.1.bn2.bias', 'base_model.resnext_layer1.1.bn3.running_var', 'module.base_model.layer3_bak.0.mse.bn3.running_var', 'module.base_model.layer3_bak.2.mse.bn3_smallscale2.bias', 'module.base_model.layer2_bak.0.mse.bn1.running_mean', 'base_model.layer4_bak.2.bn3.weight', 'module.base_model.layer3_bak.4.conv1.bias', 'base_model.layer4_bak.0.mse.conv3_smallscale4.weight', 'base_model.layer3_bak.0.bn1.bias', 'module.base_model.layer3_bak.3.mse.bn3.weight', 'base_model.layer2_bak.0.mse.bn1.bias', 'module.base_model.layer2_bak.1.bn2.running_var', 'module.base_model.layer4_bak.0.bn1.weight', 'base_model.layer2_bak.1.mse.conv3.weight', 'module.base_model.layer3_bak.5.mse.conv2.weight', 'base_model.layer4_bak.2.bn1.running_mean', 'module.base_model.layer4_bak.1.conv2.weight', 'base_model.layer1_bak.1.bn1.num_batches_tracked', 'module.base_model.layer3_bak.1.mse.conv3_smallscale4.weight', 'base_model.layer2_bak.0.mse.bn3.running_var', 'module.base_model.layer3_bak.5.mse.bn3_smallscale4.running_var', 'module.base_model.layer3_bak.1.mse.bn3_smallscale4.running_mean', 'base_model.layer2_bak.1.mse.conv3_smallscale4.weight', 'module.base_model.layer4_bak.1.conv3.bias', 'module.base_model.layer3_bak.1.bn3.weight', 'module.base_model.layer3_bak.2.bn1.weight', 'module.base_model.layer1_bak.1.bn2.num_batches_tracked', 'module.base_model.conv1.weight', 'module.base_model.layer3_bak.4.mse.bn3.num_batches_tracked', 'module.base_model.layer3_bak.0.bn1.bias', 'module.base_model.layer4_bak.0.mse.conv3_smallscale2.weight', 'module.base_model.layer4_bak.2.mse.bn1.weight', 'module.base_model.layer2_bak.3.mse.bn3.num_batches_tracked', 'base_model.layer3_bak.0.conv2.bias', 'module.base_model.layer3_bak.2.mse.bn3.bias', 'base_model.resnext_layer1.2.bn1.num_batches_tracked', 'base_model.layer3_bak.4.conv2.bias', 'base_model.layer1_bak.2.bn2.weight', 'base_model.layer4_bak.2.mse.bn1.running_mean', 'module.base_model.layer4_bak.1.mse.bn3_smallscale2.running_var', 'base_model.conv1_5.1.running_var', 'base_model.layer4_bak.1.conv1.bias', 'module.base_model.layer4_bak.2.mse.bn3.running_mean', 'base_model.layer2_bak.0.downsample.1.running_var', 'module.base_model.layer3_bak.4.mse.bn3_smallscale2.running_var', 'base_model.layer4_bak.1.bn3.num_batches_tracked', 'module.base_model.layer3_bak.1.mse.bn3_smallscale4.bias', 'base_model.layer2_bak.3.bn1.bias', 'base_model.layer1_bak.0.bn1.bias', 'base_model.resnext_layer1.2.bn1.bias', 'base_model.layer2_bak.2.mse.conv1.weight', 'base_model.layer2_bak.0.conv2.weight', 'module.base_model.layer3_bak.5.mse.bn3_smallscale2.weight', 'module.base_model.layer1_bak.0.bn2.running_var', 'module.base_model.layer2_bak.0.mse.bn3_smallscale4.weight', 'module.base_model.layer3_bak.1.bn2.num_batches_tracked', 'base_model.layer2_bak.1.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.layer3_bak.4.bn3.weight', 'module.base_model.layer3_bak.3.mse.bn1.running_var', 'base_model.layer1_bak.1.conv1.weight', 'base_model.layer1_bak.1.conv2.weight', 'module.base_model.layer4_bak.1.mse.bn3_smallscale4.bias', 'base_model.layer3_bak.2.mse.bn3_smallscale2.running_mean', 'module.base_model.layer3_bak.0.downsample.1.bias', 'module.base_model.layer3_bak.0.mse.bn3.bias', 'module.base_model.conv1_5.1.running_mean', 'base_model.layer4_bak.0.bn2.weight', 'base_model.layer4_bak.1.mse.bn3_smallscale4.running_var', 'base_model.layer2_bak.0.mse.bn3_smallscale4.num_batches_tracked', 'module.base_model.layer2_bak.0.bn3.running_var', 'base_model.layer3_bak.4.mse.bn1.num_batches_tracked', 'base_model.layer2_bak.1.conv3.weight', 'module.base_model.layer2_bak.0.bn3.num_batches_tracked', 'base_model.layer3_bak.4.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer2_bak.2.mse.bn3_smallscale4.running_var', 'module.base_model.layer3_bak.3.mse.bn3.bias', 'base_model.layer3_bak.4.bn1.weight', 'module.base_model.layer3_bak.5.mse.bn3_smallscale2.bias', 'base_model.layer4_bak.0.bn2.num_batches_tracked', 'base_model.resnext_layer1.0.bn2.running_mean', 'base_model.layer2_bak.2.bn1.running_var', 'base_model.layer3_bak.5.conv2.weight', 'base_model.layer2_bak.2.mse.bn3_smallscale4.weight', 'module.base_model.resnext_layer1.0.bn2.running_mean', 'module.base_model.layer2_bak.1.mse.bn3_smallscale4.weight', 'module.base_model.layer4_bak.1.mse.bn1.running_var', 'module.base_model.layer2_bak.3.bn2.running_mean', 'module.base_model.layer3_bak.3.mse.conv3_smallscale4.weight', 'module.base_model.resnext_layer1.2.bn2.bias', 'module.base_model.resnext_layer1.0.bn3.running_mean', 'base_model.layer3_bak.5.bn1.running_mean', 'base_model.layer2_bak.2.bn1.running_mean', 'base_model.layer3_bak.0.mse.bn1.weight', 'base_model.layer3_bak.0.downsample.1.weight', 'base_model.layer2_bak.2.bn3.bias', 'base_model.layer3_bak.4.mse.bn3.running_mean', 'module.base_model.layer2_bak.3.shift.conv.weight', 'module.base_model.layer4_bak.1.bn2.bias', 'module.base_model.layer3_bak.1.bn2.bias', 'module.base_model.layer3_bak.5.mse.bn3_smallscale2.num_batches_tracked', 'base_model.layer1_bak.0.bn3.running_var', 'module.base_model.layer3_bak.4.mse.bn3_smallscale4.running_mean', 'module.base_model.layer1_bak.0.bn3.bias', 'module.base_model.layer1_bak.2.bn3.weight', 'new_fc.weight', 'module.base_model.layer2_bak.2.mse.conv2.weight', 'module.base_model.layer2_bak.2.bn3.bias', 'module.base_model.layer3_bak.1.bn2.running_var', 'base_model.layer2_bak.2.bn2.num_batches_tracked', 'base_model.layer3_bak.2.bn3.running_mean', 'module.base_model.resnext_layer1.0.bn2.num_batches_tracked', 'module.base_model.layer1_bak.1.conv3.weight', 'module.base_model.layer4_bak.1.mse.bn3_smallscale4.running_mean', 'module.base_model.layer3_bak.2.bn3.bias', 'base_model.layer4_bak.1.shift.conv.weight', 'base_model.layer3_bak.2.bn3.bias', 'module.base_model.resnext_layer1.2.conv3.weight', 'module.base_model.layer2_bak.2.mse.bn3_smallscale4.bias', 'module.base_model.layer3_bak.1.bn3.bias', 'module.base_model.resnext_layer1.1.conv3.weight', 'base_model.layer4_bak.1.mse.bn1.num_batches_tracked', 'base_model.layer4_bak.0.conv3.weight', 'base_model.layer4_bak.2.mse.conv1.weight', 'module.base_model.layer3_bak.4.bn2.running_var', 'module.base_model.layer3_bak.0.mse.bn1.num_batches_tracked', 'base_model.layer3_bak.0.bn1.num_batches_tracked', 'base_model.layer1_bak.1.bn2.num_batches_tracked', 'module.base_model.layer1_bak.1.bn2.running_mean', 'base_model.layer2_bak.0.downsample.1.running_mean', 'module.base_model.layer1_bak.1.bn3.weight', 'base_model.conv1_5.1.weight', 'base_model.layer2_bak.2.mse.bn3_smallscale2.running_mean', 'module.base_model.layer4_bak.2.mse.conv3_smallscale4.weight', 'base_model.resnext_layer1.0.bn1.running_mean', 'module.base_model.resnext_layer1.0.conv3.weight', 'module.base_model.layer2_bak.2.mse.bn3.running_var', 'base_model.layer3_bak.3.mse.bn3_smallscale4.bias', 'base_model.layer2_bak.3.mse.bn3.weight', 'module.base_model.layer3_bak.4.mse.bn3.running_mean', 'base_model.layer1_bak.1.conv1.bias', 'module.base_model.layer1_bak.1.bn3.running_mean', 'module.base_model.layer4_bak.0.mse.bn3_smallscale2.running_var', 'base_model.layer3_bak.2.bn2.running_mean', 'module.base_model.resnext_layer1.1.bn3.running_mean', 'base_model.layer2_bak.2.mse.conv2.weight', 'base_model.layer3_bak.5.mse.bn3_smallscale2.num_batches_tracked', 'module.base_model.layer2_bak.0.downsample.1.weight', 'module.base_model.layer4_bak.0.conv3.weight', 'module.base_model.layer1_bak.0.bn1.weight', 'module.base_model.layer3_bak.4.mse.bn3.weight', 'module.base_model.layer4_bak.0.bn2.bias', 'module.base_model.layer3_bak.4.bn3.num_batches_tracked', 'base_model.layer3_bak.4.mse.bn3_smallscale2.running_var', 'base_model.layer3_bak.3.mse.bn3.running_mean', 'base_model.resnext_layer1.2.conv2.weight', 'base_model.layer4_bak.2.bn3.running_var', 'base_model.layer3_bak.3.shift.conv.weight', 'base_model.layer2_bak.2.bn3.running_var', 'module.base_model.layer4_bak.2.mse.bn3_smallscale4.weight', 'module.base_model.bn1.running_var', 'module.base_model.layer4_bak.2.mse.bn3.bias', 'module.base_model.layer3_bak.3.conv1.weight', 'base_model.layer2_bak.3.conv1.weight', 'module.base_model.layer3_bak.2.mse.bn3_smallscale2.running_mean', 'module.base_model.layer2_bak.2.bn1.num_batches_tracked', 'module.base_model.layer2_bak.1.bn1.running_mean', 'base_model.layer4_bak.1.mse.bn1.bias', 'base_model.conv1.weight', 'module.base_model.layer3_bak.3.bn1.running_mean', 'base_model.layer2_bak.2.mse.bn3_smallscale4.bias', 'module.base_model.layer2_bak.0.shift.conv.weight', 'module.base_model.bn1.weight', 'module.base_model.layer1_bak.0.downsample.0.weight', 'module.base_model.layer2_bak.2.mse.bn1.num_batches_tracked', 'base_model.layer3_bak.3.mse.bn3_smallscale2.num_batches_tracked', 'base_model.resnext_layer1.1.conv2.bias', 'module.base_model.layer1_bak.1.bn2.running_var', 'base_model.layer3_bak.2.mse.bn3_smallscale4.running_var', 'base_model.layer4_bak.1.conv2.bias', 'base_model.bn1.running_mean', 'base_model.resnext_layer1.0.conv1.bias', 'base_model.layer2_bak.0.mse.bn3_smallscale2.running_var', 'base_model.layer4_bak.0.downsample.0.weight', 'module.base_model.resnext_layer1.2.bn2.running_mean', 'base_model.layer3_bak.2.bn3.running_var', 'base_model.layer3_bak.2.bn1.bias', 'module.base_model.layer4_bak.0.mse.bn3_smallscale4.bias', 'base_model.layer3_bak.2.mse.bn1.bias', 'base_model.resnext_layer1.0.conv2.weight', 'base_model.layer4_bak.0.shift.conv.weight', 'module.base_model.layer2_bak.0.downsample.1.running_var', 'base_model.layer2_bak.1.mse.bn3_smallscale4.running_var', 'base_model.layer3_bak.3.mse.bn1.running_mean', ...........................

sanwei111 commented 1 year ago

after i changed code: model.load_state_dict(model_dict),becomes:

RuntimeError: Error(s) in loading state_dict for DistributedDataParallel: Unexpected key(s) in state_dict: "base_model.conv1.weight", "base_model.conv1.bias", "base_model.bn1.weight", "base_model.bn1.bias", "base_model.bn1.running_mean", "base_model.bn1.running_var", "base_model.bn1.num_batches_tracked", "base_model.conv1_temp.weight", "base_model.conv1_temp.bias", "base_model.conv1_5.0.weight", "base_model.conv1_5.1.weight", "b

sanwei111 commented 1 year ago

download 2 model from Kinetics400

huiyang-rgb commented 1 year ago

after i changed code: model.load_state_dict(model_dict),becomes:

RuntimeError: Error(s) in loading state_dict for DistributedDataParallel: Unexpected key(s) in state_dict: "base_model.conv1.weight", "base_model.conv1.bias", "base_model.bn1.weight", "base_model.bn1.bias", "base_model.bn1.running_mean", "base_model.bn1.running_var", "base_model.bn1.num_batches_tracked", "base_model.conv1_temp.weight", "base_model.conv1_temp.bias", "base_model.conv1_5.0.weight", "base_model.conv1_5.1.weight", "b

How do you solve it? I encountered the same problem.