feinanshan / TDNet

Temporally Distributed Networks for Fast Video Semantic Segmentation
http://cs-people.bu.edu/pinghu/TDNet
MIT License
201 stars 45 forks source link

train and validate and test #20

Closed 872544050 closed 4 years ago

872544050 commented 4 years ago

Dear author, for the same cityscape dataset, my mIOU in the training phase can reach more than 0.8, but in the test and verification phase, the mIOU of this weight is only 0.009. Which one of my settings is wrong? I've been training many times and I've had this problem all the time.

feinanshan commented 4 years ago

Hi, can you check whether the trained model in loaded properly by changing the ''False'' into ''True'' in https://github.com/feinanshan/TDNet/blob/master/Training/validate.py#L53

872544050 commented 4 years ago

I have read your code carefully, and I understand almost all the parameters that can be changed. If I change this parameter to true, the following error will occur: Traceback (most recent call last): File "/mnt/guhuayu/TDNetxin/Training/validate.py", line 153, in validate(cfg, args) File "/mnt/guhuayu/TDNetxin/Training/validate.py", line 53, in validate model.load_state_dict(state, strict=True) File "/home/usv/anaconda3/envs/TDnet1/lib/python3.7/site-packages/torch/nn/modules/module.py", line 777, in load_state_dict self.class.name, "\n\t".join(error_msgs))) RuntimeError: Error(s) in loading state_dict for td4_psp: Missing key(s) in state_dict: "pretrained1.conv1.weight", "pretrained1.bn1.weight", "pretrained1.bn1.bias", "pretrained1.bn1.running_mean", "pretrained1.bn1.running_var", "pretrained1.layer1.0.conv1.weight", "pretrained1.layer1.0.bn1.weight", "pretrained1.layer1.0.bn1.bias", "pretrained1.layer1.0.bn1.running_mean", "pretrained1.layer1.0.bn1.running_var", "pretrained1.layer1.0.conv2.weight", "pretrained1.layer1.0.bn2.weight", "pretrained1.layer1.0.bn2.bias", "pretrained1.layer1.0.bn2.running_mean", "pretrained1.layer1.0.bn2.running_var", "pretrained1.layer1.1.conv1.weight", "pretrained1.layer1.1.bn1.weight", "pretrained1.layer1.1.bn1.bias", "pretrained1.layer1.1.bn1.running_mean", "pretrained1.layer1.1.bn1.running_var", "pretrained1.layer1.1.conv2.weight", "pretrained1.layer1.1.bn2.weight", "pretrained1.layer1.1.bn2.bias", "pretrained1.layer1.1.bn2.running_mean", "pretrained1.layer1.1.bn2.running_var", "pretrained1.layer2.0.conv1.weight", "pretrained1.layer2.0.bn1.weight", "pretrained1.layer2.0.bn1.bias", "pretrained1.layer2.0.bn1.running_mean", "pretrained1.layer2.0.bn1.running_var", "pretrained1.layer2.0.conv2.weight", "pretrained1.layer2.0.bn2.weight", "pretrained1.layer2.0.bn2.bias", "pretrained1.layer2.0.bn2.running_mean", "pretrained1.layer2.0.bn2.running_var", "pretrained1.layer2.0.downsample.0.weight", "pretrained1.layer2.0.downsample.1.weight", "pretrained1.layer2.0.downsample.1.bias", "pretrained1.layer2.0.downsample.1.running_mean", "pretrained1.layer2.0.downsample.1.running_var", "pretrained1.layer2.1.conv1.weight", "pretrained1.layer2.1.bn1.weight", "pretrained1.layer2.1.bn1.bias", "pretrained1.layer2.1.bn1.running_mean", "pretrained1.layer2.1.bn1.running_var", "pretrained1.layer2.1.conv2.weight", "pretrained1.layer2.1.bn2.weight", "pretrained1.layer2.1.bn2.bias", "pretrained1.layer2.1.bn2.running_mean", "pretrained1.layer2.1.bn2.running_var", "pretrained1.layer3.0.conv1.weight", "pretrained1.layer3.0.bn1.weight", "pretrained1.layer3.0.bn1.bias", "pretrained1.layer3.0.bn1.running_mean", "pretrained1.layer3.0.bn1.running_var", "pretrained1.layer3.0.conv2.weight", "pretrained1.layer3.0.bn2.weight", "pretrained1.layer3.0.bn2.bias", "pretrained1.layer3.0.bn2.running_mean", "pretrained1.layer3.0.bn2.running_var", "pretrained1.layer3.0.downsample.0.weight", "pretrained1.layer3.0.downsample.1.weight", "pretrained1.layer3.0.downsample.1.bias", "pretrained1.layer3.0.downsample.1.running_mean", "pretrained1.layer3.0.downsample.1.running_var", "pretrained1.layer3.1.conv1.weight", "pretrained1.layer3.1.bn1.weight", "pretrained1.layer3.1.bn1.bias", "pretrained1.layer3.1.bn1.running_mean", "pretrained1.layer3.1.bn1.running_var", "pretrained1.layer3.1.conv2.weight", "pretrained1.layer3.1.bn2.weight", "pretrained1.layer3.1.bn2.bias", "pretrained1.layer3.1.bn2.running_mean", "pretrained1.layer3.1.bn2.running_var", "pretrained1.layer4.0.conv1.weight", "pretrained1.layer4.0.bn1.weight", "pretrained1.layer4.0.bn1.bias", "pretrained1.layer4.0.bn1.running_mean", "pretrained1.layer4.0.bn1.running_var", "pretrained1.layer4.0.conv2.weight", "pretrained1.layer4.0.bn2.weight", "pretrained1.layer4.0.bn2.bias", "pretrained1.layer4.0.bn2.running_mean", "pretrained1.layer4.0.bn2.running_var", "pretrained1.layer4.0.downsample.0.weight", "pretrained1.layer4.0.downsample.1.weight", "pretrained1.layer4.0.downsample.1.bias", "pretrained1.layer4.0.downsample.1.running_mean", "pretrained1.layer4.0.downsample.1.running_var", "pretrained1.layer4.1.conv1.weight", "pretrained1.layer4.1.bn1.weight", "pretrained1.layer4.1.bn1.bias", "pretrained1.layer4.1.bn1.running_mean", "pretrained1.layer4.1.bn1.running_var", "pretrained1.layer4.1.conv2.weight", "pretrained1.layer4.1.bn2.weight", "pretrained1.layer4.1.bn2.bias", "pretrained1.layer4.1.bn2.running_mean", "pretrained1.layer4.1.bn2.running_var", "pretrained1.fc.weight", "pretrained1.fc.bias", "pretrained2.conv1.weight", "pretrained2.bn1.weight", "pretrained2.bn1.bias", "pretrained2.bn1.running_mean", "pretrained2.bn1.running_var", "pretrained2.layer1.0.conv1.weight", "pretrained2.layer1.0.bn1.weight", "pretrained2.layer1.0.bn1.bias", "pretrained2.layer1.0.bn1.running_mean", "pretrained2.layer1.0.bn1.running_var", "pretrained2.layer1.0.conv2.weight", "pretrained2.layer1.0.bn2.weight", "pretrained2.layer1.0.bn2.bias", "pretrained2.layer1.0.bn2.running_mean", "pretrained2.layer1.0.bn2.running_var", "pretrained2.layer1.1.conv1.weight", "pretrained2.layer1.1.bn1.weight", "pretrained2.layer1.1.bn1.bias", "pretrained2.layer1.1.bn1.running_mean", "pretrained2.layer1.1.bn1.running_var", "pretrained2.layer1.1.conv2.weight", "pretrained2.layer1.1.bn2.weight", "pretrained2.layer1.1.bn2.bias", "pretrained2.layer1.1.bn2.running_mean", "pretrained2.layer1.1.bn2.running_var", "pretrained2.layer2.0.conv1.weight", "pretrained2.layer2.0.bn1.weight", "pretrained2.layer2.0.bn1.bias", "pretrained2.layer2.0.bn1.running_mean", "pretrained2.layer2.0.bn1.running_var", "pretrained2.layer2.0.conv2.weight", "pretrained2.layer2.0.bn2.weight", "pretrained2.layer2.0.bn2.bias", "pretrained2.layer2.0.bn2.running_mean", "pretrained2.layer2.0.bn2.running_var", "pretrained2.layer2.0.downsample.0.weight", "pretrained2.layer2.0.downsample.1.weight", "pretrained2.layer2.0.downsample.1.bias", "pretrained2.layer2.0.downsample.1.running_mean", "pretrained2.layer2.0.downsample.1.running_var", "pretrained2.layer2.1.conv1.weight", "pretrained2.layer2.1.bn1.weight", "pretrained2.layer2.1.bn1.bias", "pretrained2.layer2.1.bn1.running_mean", "pretrained2.layer2.1.bn1.running_var", "pretrained2.layer2.1.conv2.weight", "pretrained2.layer2.1.bn2.weight", "pretrained2.layer2.1.bn2.bias", "pretrained2.layer2.1.bn2.running_mean", "pretrained2.layer2.1.bn2.running_var", "pretrained2.layer3.0.conv1.weight", "pretrained2.layer3.0.bn1.weight", "pretrained2.layer3.0.bn1.bias", "pretrained2.layer3.0.bn1.running_mean", "pretrained2.layer3.0.bn1.running_var", "pretrained2.layer3.0.conv2.weight", "pretrained2.layer3.0.bn2.weight", "pretrained2.layer3.0.bn2.bias", "pretrained2.layer3.0.bn2.running_mean", "pretrained2.layer3.0.bn2.running_var", "pretrained2.layer3.0.downsample.0.weight", "pretrained2.layer3.0.downsample.1.weight", "pretrained2.layer3.0.downsample.1.bias", "pretrained2.layer3.0.downsample.1.running_mean", "pretrained2.layer3.0.downsample.1.running_var", "pretrained2.layer3.1.conv1.weight", "pretrained2.layer3.1.bn1.weight", "pretrained2.layer3.1.bn1.bias", "pretrained2.layer3.1.bn1.running_mean", "pretrained2.layer3.1.bn1.running_var", "pretrained2.layer3.1.conv2.weight", "pretrained2.layer3.1.bn2.weight", "pretrained2.layer3.1.bn2.bias", "pretrained2.layer3.1.bn2.running_mean", "pretrained2.layer3.1.bn2.running_var", "pretrained2.layer4.0.conv1.weight", "pretrained2.layer4.0.bn1.weight", "pretrained2.layer4.0.bn1.bias", "pretrained2.layer4.0.bn1.running_mean", "pretrained2.layer4.0.bn1.running_var", "pretrained2.layer4.0.conv2.weight", "pretrained2.layer4.0.bn2.weight", "pretrained2.layer4.0.bn2.bias", "pretrained2.layer4.0.bn2.running_mean", "pretrained2.layer4.0.bn2.running_var", "pretrained2.layer4.0.downsample.0.weight", "pretrained2.layer4.0.downsample.1.weight", "pretrained2.layer4.0.downsample.1.bias", "pretrained2.layer4.0.downsample.1.running_mean", "pretrained2.layer4.0.downsample.1.running_var", "pretrained2.layer4.1.conv1.weight", "pretrained2.layer4.1.bn1.weight", "pretrained2.layer4.1.bn1.bias", "pretrained2.layer4.1.bn1.running_mean", "pretrained2.layer4.1.bn1.running_var", "pretrained2.layer4.1.conv2.weight", "pretrained2.layer4.1.bn2.weight", "pretrained2.layer4.1.bn2.bias", "pretrained2.layer4.1.bn2.running_mean", "pretrained2.layer4.1.bn2.running_var", "pretrained2.fc.weight", "pretrained2.fc.bias", "pretrained3.conv1.weight", "pretrained3.bn1.weight", "pretrained3.bn1.bias", "pretrained3.bn1.running_mean", "pretrained3.bn1.running_var", "pretrained3.layer1.0.conv1.weight", "pretrained3.layer1.0.bn1.weight", "pretrained3.layer1.0.bn1.bias", "pretrained3.layer1.0.bn1.running_mean", "pretrained3.layer1.0.bn1.running_var", "pretrained3.layer1.0.conv2.weight", "pretrained3.layer1.0.bn2.weight", "pretrained3.layer1.0.bn2.bias", "pretrained3.layer1.0.bn2.running_mean", "pretrained3.layer1.0.bn2.running_var", "pretrained3.layer1.1.conv1.weight", "pretrained3.layer1.1.bn1.weight", "pretrained3.layer1.1.bn1.bias", "pretrained3.layer1.1.bn1.running_mean", "pretrained3.layer1.1.bn1.running_var", "pretrained3.layer1.1.conv2.weight", "pretrained3.layer1.1.bn2.weight", "pretrained3.layer1.1.bn2.bias", "pretrained3.layer1.1.bn2.running_mean", "pretrained3.layer1.1.bn2.running_var", "pretrained3.layer2.0.conv1.weight", "pretrained3.layer2.0.bn1.weight", "pretrained3.layer2.0.bn1.bias", "pretrained3.layer2.0.bn1.running_mean", "pretrained3.layer2.0.bn1.running_var", "pretrained3.layer2.0.conv2.weight", "pretrained3.layer2.0.bn2.weight", "pretrained3.layer2.0.bn2.bias", "pretrained3.layer2.0.bn2.running_mean", "pretrained3.layer2.0.bn2.running_var", "pretrained3.layer2.0.downsample.0.weight", "pretrained3.layer2.0.downsample.1.weight", "pretrained3.layer2.0.downsample.1.bias", "pretrained3.layer2.0.downsample.1.running_mean", "pretrained3.layer2.0.downsample.1.running_var", "pretrained3.layer2.1.conv1.weight", "pretrained3.layer2.1.bn1.weight", "pretrained3.layer2.1.bn1.bias", "pretrained3.layer2.1.bn1.running_mean", "pretrained3.layer2.1.bn1.running_var", "pretrained3.layer2.1.conv2.weight", "pretrained3.layer2.1.bn2.weight", "pretrained3.layer2.1.bn2.bias", "pretrained3.layer2.1.bn2.running_mean", "pretrained3.layer2.1.bn2.running_var", "pretrained3.layer3.0.conv1.weight", "pretrained3.layer3.0.bn1.weight", "pretrained3.layer3.0.bn1.bias", "pretrained3.layer3.0.bn1.running_mean", "pretrained3.layer3.0.bn1.running_var", "pretrained3.layer3.0.conv2.weight", "pretrained3.layer3.0.bn2.weight", "pretrained3.layer3.0.bn2.bias", "pretrained3.layer3.0.bn2.running_mean", "pretrained3.layer3.0.bn2.running_var", "pretrained3.layer3.0.downsample.0.weight", "pretrained3.layer3.0.downsample.1.weight", "pretrained3.layer3.0.downsample.1.bias", "pretrained3.layer3.0.downsample.1.running_mean", "pretrained3.layer3.0.downsample.1.running_var", "pretrained3.layer3.1.conv1.weight", "pretrained3.layer3.1.bn1.weight", "pretrained3.layer3.1.bn1.bias", "pretrained3.layer3.1.bn1.running_mean", "pretrained3.layer3.1.bn1.running_var", "pretrained3.layer3.1.conv2.weight", "pretrained3.layer3.1.bn2.weight", "pretrained3.layer3.1.bn2.bias", "pretrained3.layer3.1.bn2.running_mean", "pretrained3.layer3.1.bn2.running_var", "pretrained3.layer4.0.conv1.weight", "pretrained3.layer4.0.bn1.weight", "pretrained3.layer4.0.bn1.bias", "pretrained3.layer4.0.bn1.running_mean", "pretrained3.layer4.0.bn1.running_var", "pretrained3.layer4.0.conv2.weight", "pretrained3.layer4.0.bn2.weight", "pretrained3.layer4.0.bn2.bias", "pretrained3.layer4.0.bn2.running_mean", "pretrained3.layer4.0.bn2.running_var", "pretrained3.layer4.0.downsample.0.weight", "pretrained3.layer4.0.downsample.1.weight", "pretrained3.layer4.0.downsample.1.bias", "pretrained3.layer4.0.downsample.1.running_mean", "pretrained3.layer4.0.downsample.1.running_var", "pretrained3.layer4.1.conv1.weight", "pretrained3.layer4.1.bn1.weight", "pretrained3.layer4.1.bn1.bias", "pretrained3.layer4.1.bn1.running_mean", "pretrained3.layer4.1.bn1.running_var", "pretrained3.layer4.1.conv2.weight", "pretrained3.layer4.1.bn2.weight", "pretrained3.layer4.1.bn2.bias", "pretrained3.layer4.1.bn2.running_mean", "pretrained3.layer4.1.bn2.running_var", "pretrained3.fc.weight", "pretrained3.fc.bias", "pretrained4.conv1.weight", "pretrained4.bn1.weight", "pretrained4.bn1.bias", "pretrained4.bn1.running_mean", "pretrained4.bn1.running_var", "pretrained4.layer1.0.conv1.weight", "pretrained4.layer1.0.bn1.weight", "pretrained4.layer1.0.bn1.bias", "pretrained4.layer1.0.bn1.running_mean", "pretrained4.layer1.0.bn1.running_var", "pretrained4.layer1.0.conv2.weight", "pretrained4.layer1.0.bn2.weight", "pretrained4.layer1.0.bn2.bias", "pretrained4.layer1.0.bn2.running_mean", "pretrained4.layer1.0.bn2.running_var", "pretrained4.layer1.1.conv1.weight", "pretrained4.layer1.1.bn1.weight", "pretrained4.layer1.1.bn1.bias", "pretrained4.layer1.1.bn1.running_mean", "pretrained4.layer1.1.bn1.running_var", "pretrained4.layer1.1.conv2.weight", "pretrained4.layer1.1.bn2.weight", "pretrained4.layer1.1.bn2.bias", "pretrained4.layer1.1.bn2.running_mean", "pretrained4.layer1.1.bn2.running_var", "pretrained4.layer2.0.conv1.weight", "pretrained4.layer2.0.bn1.weight", "pretrained4.layer2.0.bn1.bias", "pretrained4.layer2.0.bn1.running_mean", "pretrained4.layer2.0.bn1.running_var", "pretrained4.layer2.0.conv2.weight", "pretrained4.layer2.0.bn2.weight", "pretrained4.layer2.0.bn2.bias", "pretrained4.layer2.0.bn2.running_mean", "pretrained4.layer2.0.bn2.running_var", "pretrained4.layer2.0.downsample.0.weight", "pretrained4.layer2.0.downsample.1.weight", "pretrained4.layer2.0.downsample.1.bias", "pretrained4.layer2.0.downsample.1.running_mean", "pretrained4.layer2.0.downsample.1.running_var", "pretrained4.layer2.1.conv1.weight", "pretrained4.layer2.1.bn1.weight", "pretrained4.layer2.1.bn1.bias", "pretrained4.layer2.1.bn1.running_mean", "pretrained4.layer2.1.bn1.running_var", "pretrained4.layer2.1.conv2.weight", "pretrained4.layer2.1.bn2.weight", "pretrained4.layer2.1.bn2.bias", "pretrained4.layer2.1.bn2.running_mean", "pretrained4.layer2.1.bn2.running_var", "pretrained4.layer3.0.conv1.weight", "pretrained4.layer3.0.bn1.weight", "pretrained4.layer3.0.bn1.bias", "pretrained4.layer3.0.bn1.running_mean", "pretrained4.layer3.0.bn1.running_var", "pretrained4.layer3.0.conv2.weight", "pretrained4.layer3.0.bn2.weight", "pretrained4.layer3.0.bn2.bias", "pretrained4.layer3.0.bn2.running_mean", "pretrained4.layer3.0.bn2.running_var", "pretrained4.layer3.0.downsample.0.weight", "pretrained4.layer3.0.downsample.1.weight", "pretrained4.layer3.0.downsample.1.bias", "pretrained4.layer3.0.downsample.1.running_mean", "pretrained4.layer3.0.downsample.1.running_var", "pretrained4.layer3.1.conv1.weight", "pretrained4.layer3.1.bn1.weight", "pretrained4.layer3.1.bn1.bias", "pretrained4.layer3.1.bn1.running_mean", "pretrained4.layer3.1.bn1.running_var", "pretrained4.layer3.1.conv2.weight", "pretrained4.layer3.1.bn2.weight", "pretrained4.layer3.1.bn2.bias", "pretrained4.layer3.1.bn2.running_mean", "pretrained4.layer3.1.bn2.running_var", "pretrained4.layer4.0.conv1.weight", "pretrained4.layer4.0.bn1.weight", "pretrained4.layer4.0.bn1.bias", "pretrained4.layer4.0.bn1.running_mean", "pretrained4.layer4.0.bn1.running_var", "pretrained4.layer4.0.conv2.weight", "pretrained4.layer4.0.bn2.weight", "pretrained4.layer4.0.bn2.bias", "pretrained4.layer4.0.bn2.running_mean", "pretrained4.layer4.0.bn2.running_var", "pretrained4.layer4.0.downsample.0.weight", "pretrained4.layer4.0.downsample.1.weight", "pretrained4.layer4.0.downsample.1.bias", "pretrained4.layer4.0.downsample.1.running_mean", "pretrained4.layer4.0.downsample.1.running_var", "pretrained4.layer4.1.conv1.weight", "pretrained4.layer4.1.bn1.weight", "pretrained4.layer4.1.bn1.bias", "pretrained4.layer4.1.bn1.running_mean", "pretrained4.layer4.1.bn1.running_var", "pretrained4.layer4.1.conv2.weight", "pretrained4.layer4.1.bn2.weight", "pretrained4.layer4.1.bn2.bias", "pretrained4.layer4.1.bn2.running_mean", "pretrained4.layer4.1.bn2.running_var", "pretrained4.fc.weight", "pretrained4.fc.bias", "psp1.conv1.0.weight", "psp1.conv1.1.weight", "psp1.conv1.1.bias", "psp1.conv1.1.running_mean", "psp1.conv1.1.running_var", "psp1.conv2.0.weight", "psp1.conv2.1.weight", "psp1.conv2.1.bias", "psp1.conv2.1.running_mean", "psp1.conv2.1.running_var", "psp1.conv3.0.weight", "psp1.conv3.1.weight", "psp1.conv3.1.bias", "psp1.conv3.1.running_mean", "psp1.conv3.1.running_var", "psp1.conv4.0.weight", "psp1.conv4.1.weight", "psp1.conv4.1.bias", "psp1.conv4.1.running_mean", "psp1.conv4.1.running_var", "psp2.conv1.0.weight", "psp2.conv1.1.weight", "psp2.conv1.1.bias", "psp2.conv1.1.running_mean", "psp2.conv1.1.running_var", "psp2.conv2.0.weight", "psp2.conv2.1.weight", "psp2.conv2.1.bias", "psp2.conv2.1.running_mean", "psp2.conv2.1.running_var", "psp2.conv3.0.weight", "psp2.conv3.1.weight", "psp2.conv3.1.bias", "psp2.conv3.1.running_mean", "psp2.conv3.1.running_var", "psp2.conv4.0.weight", "psp2.conv4.1.weight", "psp2.conv4.1.bias", "psp2.conv4.1.running_mean", "psp2.conv4.1.running_var", "psp3.conv1.0.weight", "psp3.conv1.1.weight", "psp3.conv1.1.bias", "psp3.conv1.1.running_mean", "psp3.conv1.1.running_var", "psp3.conv2.0.weight", "psp3.conv2.1.weight", "psp3.conv2.1.bias", "psp3.conv2.1.running_mean", "psp3.conv2.1.running_var", "psp3.conv3.0.weight", "psp3.conv3.1.weight", "psp3.conv3.1.bias", "psp3.conv3.1.running_mean", "psp3.conv3.1.running_var", "psp3.conv4.0.weight", "psp3.conv4.1.weight", "psp3.conv4.1.bias", "psp3.conv4.1.running_mean", "psp3.conv4.1.running_var", "psp4.conv1.0.weight", "psp4.conv1.1.weight", "psp4.conv1.1.bias", "psp4.conv1.1.running_mean", "psp4.conv1.1.running_var", "psp4.conv2.0.weight", "psp4.conv2.1.weight", "psp4.conv2.1.bias", "psp4.conv2.1.running_mean", "psp4.conv2.1.running_var", "psp4.conv3.0.weight", "psp4.conv3.1.weight", "psp4.conv3.1.bias", "psp4.conv3.1.running_mean", "psp4.conv3.1.running_var", "psp4.conv4.0.weight", "psp4.conv4.1.weight", "psp4.conv4.1.bias", "psp4.conv4.1.running_mean", "psp4.conv4.1.running_var", "enc1.w_qs.0.conv.weight", "enc1.w_qs.0.conv.bias", "enc1.w_qs.0.bn.weight", "enc1.w_qs.0.bn.bias", "enc1.w_qs.0.bn.running_mean", "enc1.w_qs.0.bn.running_var", "enc1.w_qs.1.conv.weight", "enc1.w_qs.1.conv.bias", "enc1.w_ks.0.conv.weight", "enc1.w_ks.0.conv.bias", "enc1.w_ks.0.bn.weight", "enc1.w_ks.0.bn.bias", "enc1.w_ks.0.bn.running_mean", "enc1.w_ks.0.bn.running_var", "enc1.w_ks.1.conv.weight", "enc1.w_ks.1.conv.bias", "enc1.w_vs.0.conv.weight", "enc1.w_vs.0.conv.bias", "enc2.w_qs.0.conv.weight", "enc2.w_qs.0.conv.bias", "enc2.w_qs.0.bn.weight", "enc2.w_qs.0.bn.bias", "enc2.w_qs.0.bn.running_mean", "enc2.w_qs.0.bn.running_var", "enc2.w_qs.1.conv.weight", "enc2.w_qs.1.conv.bias", "enc2.w_ks.0.conv.weight", "enc2.w_ks.0.conv.bias", "enc2.w_ks.0.bn.weight", "enc2.w_ks.0.bn.bias", "enc2.w_ks.0.bn.running_mean", "enc2.w_ks.0.bn.running_var", "enc2.w_ks.1.conv.weight", "enc2.w_ks.1.conv.bias", "enc2.w_vs.0.conv.weight", "enc2.w_vs.0.conv.bias", "enc3.w_qs.0.conv.weight", "enc3.w_qs.0.conv.bias", "enc3.w_qs.0.bn.weight", "enc3.w_qs.0.bn.bias", "enc3.w_qs.0.bn.running_mean", "enc3.w_qs.0.bn.running_var", "enc3.w_qs.1.conv.weight", "enc3.w_qs.1.conv.bias", "enc3.w_ks.0.conv.weight", "enc3.w_ks.0.conv.bias", "enc3.w_ks.0.bn.weight", "enc3.w_ks.0.bn.bias", "enc3.w_ks.0.bn.running_mean", "enc3.w_ks.0.bn.running_var", "enc3.w_ks.1.conv.weight", "enc3.w_ks.1.conv.bias", "enc3.w_vs.0.conv.weight", "enc3.w_vs.0.conv.bias", "enc4.w_qs.0.conv.weight", "enc4.w_qs.0.conv.bias", "enc4.w_qs.0.bn.weight", "enc4.w_qs.0.bn.bias", "enc4.w_qs.0.bn.running_mean", "enc4.w_qs.0.bn.running_var", "enc4.w_qs.1.conv.weight", "enc4.w_qs.1.conv.bias", "enc4.w_ks.0.conv.weight", "enc4.w_ks.0.conv.bias", "enc4.w_ks.0.bn.weight", "enc4.w_ks.0.bn.bias", "enc4.w_ks.0.bn.running_mean", "enc4.w_ks.0.bn.running_var", "enc4.w_ks.1.conv.weight", "enc4.w_ks.1.conv.bias", "enc4.w_vs.0.conv.weight", "enc4.w_vs.0.conv.bias", "atn1_2.fc.0.conv.weight", "atn1_2.fc.0.conv.bias", "atn1_3.fc.0.conv.weight", "atn1_3.fc.0.conv.bias", "atn1_4.fc.0.conv.weight", "atn1_4.fc.0.conv.bias", "atn2_1.fc.0.conv.weight", "atn2_1.fc.0.conv.bias", "atn2_3.fc.0.conv.weight", "atn2_3.fc.0.conv.bias", "atn2_4.fc.0.conv.weight", "atn2_4.fc.0.conv.bias", "atn3_1.fc.0.conv.weight", "atn3_1.fc.0.conv.bias", "atn3_2.fc.0.conv.weight", "atn3_2.fc.0.conv.bias", "atn3_4.fc.0.conv.weight", "atn3_4.fc.0.conv.bias", "atn4_1.fc.0.conv.weight", "atn4_1.fc.0.conv.bias", "atn4_2.fc.0.conv.weight", "atn4_2.fc.0.conv.bias", "atn4_3.fc.0.conv.weight", "atn4_3.fc.0.conv.bias", "layer_norm1.ln.weight", "layer_norm1.ln.bias", "layer_norm2.ln.weight", "layer_norm2.ln.bias", "layer_norm3.ln.weight", "layer_norm3.ln.bias", "layer_norm4.ln.weight", "layer_norm4.ln.bias", "head1.conv5.0.weight", "head1.conv5.1.weight", "head1.conv5.1.bias", "head1.conv5.1.running_mean", "head1.conv5.1.running_var", "head1.conv5.4.weight", "head1.conv5.4.bias", "head2.conv5.0.weight", "head2.conv5.1.weight", "head2.conv5.1.bias", "head2.conv5.1.running_mean", "head2.conv5.1.running_var", "head2.conv5.4.weight", "head2.conv5.4.bias", "head3.conv5.0.weight", "head3.conv5.1.weight", "head3.conv5.1.bias", "head3.conv5.1.running_mean", "head3.conv5.1.running_var", "head3.conv5.4.weight", "head3.conv5.4.bias", "head4.conv5.0.weight", "head4.conv5.1.weight", "head4.conv5.1.bias", "head4.conv5.1.running_mean", "head4.conv5.1.running_var", "head4.conv5.4.weight", "head4.conv5.4.bias", "auxlayer1.conv5.0.weight", "auxlayer1.conv5.1.weight", "auxlayer1.conv5.1.bias", "auxlayer1.conv5.1.running_mean", "auxlayer1.conv5.1.running_var", "auxlayer1.conv5.4.weight", "auxlayer1.conv5.4.bias", "auxlayer2.conv5.0.weight", "auxlayer2.conv5.1.weight", "auxlayer2.conv5.1.bias", "auxlayer2.conv5.1.running_mean", "auxlayer2.conv5.1.running_var", "auxlayer2.conv5.4.weight", "auxlayer2.conv5.4.bias", "auxlayer3.conv5.0.weight", "auxlayer3.conv5.1.weight", "auxlayer3.conv5.1.bias", "auxlayer3.conv5.1.running_mean", "auxlayer3.conv5.1.running_var", "auxlayer3.conv5.4.weight", "auxlayer3.conv5.4.bias", "auxlayer4.conv5.0.weight", "auxlayer4.conv5.1.weight", "auxlayer4.conv5.1.bias", "auxlayer4.conv5.1.running_mean", "auxlayer4.conv5.1.running_var", "auxlayer4.conv5.4.weight", "auxlayer4.conv5.4.bias", "teacher.pretrained.conv1.0.weight", "teacher.pretrained.conv1.1.weight", "teacher.pretrained.conv1.1.bias", "teacher.pretrained.conv1.1.running_mean", "teacher.pretrained.conv1.1.running_var", "teacher.pretrained.conv1.3.weight", "teacher.pretrained.conv1.4.weight", "teacher.pretrained.conv1.4.bias", "teacher.pretrained.conv1.4.running_mean", "teacher.pretrained.conv1.4.running_var", "teacher.pretrained.conv1.6.weight", "teacher.pretrained.bn1.weight", "teacher.pretrained.bn1.bias", "teacher.pretrained.bn1.running_mean", "teacher.pretrained.bn1.running_var", "teacher.pretrained.layer1.0.conv1.weight", "teacher.pretrained.layer1.0.bn1.weight", "teacher.pretrained.layer1.0.bn1.bias", "teacher.pretrained.layer1.0.bn1.running_mean", "teacher.pretrained.layer1.0.bn1.running_var", "teacher.pretrained.layer1.0.conv2.weight", "teacher.pretrained.layer1.0.bn2.weight", "teacher.pretrained.layer1.0.bn2.bias", "teacher.pretrained.layer1.0.bn2.running_mean", "teacher.pretrained.layer1.0.bn2.running_var", "teacher.pretrained.layer1.0.conv3.weight", "teacher.pretrained.layer1.0.bn3.weight", "teacher.pretrained.layer1.0.bn3.bias", "teacher.pretrained.layer1.0.bn3.running_mean", "teacher.pretrained.layer1.0.bn3.running_var", "teacher.pretrained.layer1.0.downsample.0.weight", "teacher.pretrained.layer1.0.downsample.1.weight", "teacher.pretrained.layer1.0.downsample.1.bias", "teacher.pretrained.layer1.0.downsample.1.running_mean", "teacher.pretrained.layer1.0.downsample.1.running_var", "teacher.pretrained.layer1.1.conv1.weight", "teacher.pretrained.layer1.1.bn1.weight", "teacher.pretrained.layer1.1.bn1.bias", "teacher.pretrained.layer1.1.bn1.running_mean", "teacher.pretrained.layer1.1.bn1.running_var", "teacher.pretrained.layer1.1.conv2.weight", "teacher.pretrained.layer1.1.bn2.weight", "teacher.pretrained.layer1.1.bn2.bias", "teacher.pretrained.layer1.1.bn2.running_mean", "teacher.pretrained.layer1.1.bn2.running_var", "teacher.pretrained.layer1.1.conv3.weight", "teacher.pretrained.layer1.1.bn3.weight", "teacher.pretrained.layer1.1.bn3.bias", "teacher.pretrained.layer1.1.bn3.running_mean", "teacher.pretrained.layer1.1.bn3.running_var", "teacher.pretrained.layer1.2.conv1.weight", "teacher.pretrained.layer1.2.bn1.weight", "teacher.pretrained.layer1.2.bn1.bias", "teacher.pretrained.layer1.2.bn1.running_mean", "teacher.pretrained.layer1.2.bn1.running_var", "teacher.pretrained.layer1.2.conv2.weight", "teacher.pretrained.layer1.2.bn2.weight", "teacher.pretrained.layer1.2.bn2.bias", "teacher.pretrained.layer1.2.bn2.running_mean", "teacher.pretrained.layer1.2.bn2.running_var", "teacher.pretrained.layer1.2.conv3.weight", "teacher.pretrained.layer1.2.bn3.weight", "teacher.pretrained.layer1.2.bn3.bias", "teacher.pretrained.layer1.2.bn3.running_mean", "teacher.pretrained.layer1.2.bn3.running_var", "teacher.pretrained.layer2.0.conv1.weight", "teacher.pretrained.layer2.0.bn1.weight", "teacher.pretrained.layer2.0.bn1.bias", "teacher.pretrained.layer2.0.bn1.running_mean", "teacher.pretrained.layer2.0.bn1.running_var", "teacher.pretrained.layer2.0.conv2.weight", "teacher.pretrained.layer2.0.bn2.weight", "teacher.pretrained.layer2.0.bn2.bias", "teacher.pretrained.layer2.0.bn2.running_mean", "teacher.pretrained.layer2.0.bn2.running_var", "teacher.pretrained.layer2.0.conv3.weight", "teacher.pretrained.layer2.0.bn3.weight", "teacher.pretrained.layer2.0.bn3.bias", "teacher.pretrained.layer2.0.bn3.running_mean", "teacher.pretrained.layer2.0.bn3.running_var", "teacher.pretrained.layer2.0.downsample.0.weight", "teacher.pretrained.layer2.0.downsample.1.weight", "teacher.pretrained.layer2.0.downsample.1.bias", "teacher.pretrained.layer2.0.downsample.1.running_mean", "teacher.pretrained.layer2.0.downsample.1.running_var", "teacher.pretrained.layer2.1.conv1.weight", "teacher.pretrained.layer2.1.bn1.weight", "teacher.pretrained.layer2.1.bn1.bias", "teacher.pretrained.layer2.1.bn1.running_mean", "teacher.pretrained.layer2.1.bn1.running_var", "teacher.pretrained.layer2.1.conv2.weight", "teacher.pretrained.layer2.1.bn2.weight", "teacher.pretrained.layer2.1.bn2.bias", "teacher.pretrained.layer2.1.bn2.running_mean", "teacher.pretrained.layer2.1.bn2.running_var", "teacher.pretrained.layer2.1.conv3.weight", "teacher.pretrained.layer2.1.bn3.weight", "teacher.pretrained.layer2.1.bn3.bias", "teacher.pretrained.layer2.1.bn3.running_mean", "teacher.pretrained.layer2.1.bn3.running_var", "teacher.pretrained.layer2.2.conv1.weight", "teacher.pretrained.layer2.2.bn1.weight", "teacher.pretrained.layer2.2.bn1.bias", "teacher.pretrained.layer2.2.bn1.running_mean", "teacher.pretrained.layer2.2.bn1.running_var", "teacher.pretrained.layer2.2.conv2.weight", "teacher.pretrained.layer2.2.bn2.weight", "teacher.pretrained.layer2.2.bn2.bias", "teacher.pretrained.layer2.2.bn2.running_mean", "teacher.pretrained.layer2.2.bn2.running_var", "teacher.pretrained.layer2.2.conv3.weight", "teacher.pretrained.layer2.2.bn3.weight", "teacher.pretrained.layer2.2.bn3.bias", "teacher.pretrained.layer2.2.bn3.running_mean", "teacher.pretrained.layer2.2.bn3.running_var", "teacher.pretrained.layer2.3.conv1.weight", "teacher.pretrained.layer2.3.bn1.weight", "teacher.pretrained.layer2.3.bn1.bias", "teacher.pretrained.layer2.3.bn1.running_mean", "teacher.pretrained.layer2.3.bn1.running_var", "teacher.pretrained.layer2.3.conv2.weight", "teacher.pretrained.layer2.3.bn2.weight", "teacher.pretrained.layer2.3.bn2.bias", "teacher.pretrained.layer2.3.bn2.running_mean", "teacher.pretrained.layer2.3.bn2.running_var", "teacher.pretrained.layer2.3.conv3.weight", "teacher.pretrained.layer2.3.bn3.weight", "teacher.pretrained.layer2.3.bn3.bias", "teacher.pretrained.layer2.3.bn3.running_mean", "teacher.pretrained.layer2.3.bn3.running_var", "teacher.pretrained.layer3.0.conv1.weight", "teacher.pretrained.layer3.0.bn1.weight", "teacher.pretrained.layer3.0.bn1.bias", "teacher.pretrained.layer3.0.bn1.running_mean", "teacher.pretrained.layer3.0.bn1.running_var", "teacher.pretrained.layer3.0.conv2.weight", "teacher.pretrained.layer3.0.bn2.weight", "teacher.pretrained.layer3.0.bn2.bias", "teacher.pretrained.layer3.0.bn2.running_mean", "teacher.pretrained.layer3.0.bn2.running_var", "teacher.pretrained.layer3.0.conv3.weight", "teacher.pretrained.layer3.0.bn3.weight", "teacher.pretrained.layer3.0.bn3.bias", "teacher.pretrained.layer3.0.bn3.running_mean", "teacher.pretrained.layer3.0.bn3.running_var", "teacher.pretrained.layer3.0.downsample.0.weight", "teacher.pretrained.layer3.0.downsample.1.weight", "teacher.pretrained.layer3.0.downsample.1.bias", "teacher.pretrained.layer3.0.downsample.1.running_mean", "teacher.pretrained.layer3.0.downsample.1.running_var", "teacher.pretrained.layer3.1.conv1.weight", "teacher.pretrained.layer3.1.bn1.weight", "teacher.pretrained.layer3.1.bn1.bias", "teacher.pretrained.layer3.1.bn1.running_mean", "teacher.pretrained.layer3.1.bn1.running_var", "teacher.pretrained.layer3.1.conv2.weight", "teacher.pretrained.layer3.1.bn2.weight", "teacher.pretrained.layer3.1.bn2.bias", "teacher.pretrained.layer3.1.bn2.running_mean", "teacher.pretrained.layer3.1.bn2.running_var", "teacher.pretrained.layer3.1.conv3.weight", "teacher.pretrained.layer3.1.bn3.weight", "teacher.pretrained.layer3.1.bn3.bias", "teacher.pretrained.layer3.1.bn3.running_mean", "teacher.pretrained.layer3.1.bn3.running_var", "teacher.pretrained.layer3.2.conv1.weight", "teacher.pretrained.layer3.2.bn1.weight", "teacher.pretrained.layer3.2.bn1.bias", "teacher.pretrained.layer3.2.bn1.running_mean", "teacher.pretrained.layer3.2.bn1.running_var", "teacher.pretrained.layer3.2.conv2.weight", "teacher.pretrained.layer3.2.bn2.weight", "teacher.pretrained.layer3.2.bn2.bias", "teacher.pretrained.layer3.2.bn2.running_mean", "teacher.pretrained.layer3.2.bn2.running_var", "teacher.pretrained.layer3.2.conv3.weight", "teacher.pretrained.layer3.2.bn3.weight", "teacher.pretrained.layer3.2.bn3.bias", "teacher.pretrained.layer3.2.bn3.running_mean", "teacher.pretrained.layer3.2.bn3.running_var", "teacher.pretrained.layer3.3.conv1.weight", "teacher.pretrained.layer3.3.bn1.weight", "teacher.pretrained.layer3.3.bn1.bias", "teacher.pretrained.layer3.3.bn1.running_mean", "teacher.pretrained.layer3.3.bn1.running_var", "teacher.pretrained.layer3.3.conv2.weight", "teacher.pretrained.layer3.3.bn2.weight", "teacher.pretrained.layer3.3.bn2.bias", "teacher.pretrained.layer3.3.bn2.running_mean", "teacher.pretrained.layer3.3.bn2.running_var", "teacher.pretrained.layer3.3.conv3.weight", "teacher.pretrained.layer3.3.bn3.weight", "teacher.pretrained.layer3.3.bn3.bias", "teacher.pretrained.layer3.3.bn3.running_mean", "teacher.pretrained.layer3.3.bn3.running_var", "teacher.pretrained.layer3.4.conv1.weight", "teacher.pretrained.layer3.4.bn1.weight", "teacher.pretrained.layer3.4.bn1.bias", "teacher.pretrained.layer3.4.bn1.running_mean", "teacher.pretrained.layer3.4.bn1.running_var", "teacher.pretrained.layer3.4.conv2.weight", "teacher.pretrained.layer3.4.bn2.weight", "teacher.pretrained.layer3.4.bn2.bias", "teacher.pretrained.layer3.4.bn2.running_mean", "teacher.pretrained.layer3.4.bn2.running_var", "teacher.pretrained.layer3.4.conv3.weight", "teacher.pretrained.layer3.4.bn3.weight", "teacher.pretrained.layer3.4.bn3.bias", "teacher.pretrained.layer3.4.bn3.running_mean", "teacher.pretrained.layer3.4.bn3.running_var", "teacher.pretrained.layer3.5.conv1.weight", "teacher.pretrained.layer3.5.bn1.weight", "teacher.pretrained.layer3.5.bn1.bias", "teacher.pretrained.layer3.5.bn1.running_mean", "teacher.pretrained.layer3.5.bn1.running_var", "teacher.pretrained.layer3.5.conv2.weight", "teacher.pretrained.layer3.5.bn2.weight", "teacher.pretrained.layer3.5.bn2.bias", "teacher.pretrained.layer3.5.bn2.running_mean", "teacher.pretrained.layer3.5.bn2.running_var", "teacher.pretrained.layer3.5.conv3.weight", "teacher.pretrained.layer3.5.bn3.weight", "teacher.pretrained.layer3.5.bn3.bias", "teacher.pretrained.layer3.5.bn3.running_mean", "teacher.pretrained.layer3.5.bn3.running_var", "teacher.pretrained.layer3.6.conv1.weight", "teacher.pretrained.layer3.6.bn1.weight", "teacher.pretrained.layer3.6.bn1.bias", "teacher.pretrained.layer3.6.bn1.running_mean", "teacher.pretrained.layer3.6.bn1.running_var", "teacher.pretrained.layer3.6.conv2.weight", "teacher.pretrained.layer3.6.bn2.weight", "teacher.pretrained.layer3.6.bn2.bias", "teacher.pretrained.layer3.6.bn2.running_mean", "teacher.pretrained.layer3.6.bn2.running_var", "teacher.pretrained.layer3.6.conv3.weight", "teacher.pretrained.layer3.6.bn3.weight", "teacher.pretrained.layer3.6.bn3.bias", "teacher.pretrained.layer3.6.bn3.running_mean", "teacher.pretrained.layer3.6.bn3.running_var", "teacher.pretrained.layer3.7.conv1.weight", "teacher.pretrained.layer3.7.bn1.weight", "teacher.pretrained.layer3.7.bn1.bias", "teacher.pretrained.layer3.7.bn1.running_mean", "teacher.pretrained.layer3.7.bn1.running_var", "teacher.pretrained.layer3.7.conv2.weight", "teacher.pretrained.layer3.7.bn2.weight", "teacher.pretrained.layer3.7.bn2.bias", "teacher.pretrained.layer3.7.bn2.running_mean", "teacher.pretrained.layer3.7.bn2.running_var", "teacher.pretrained.layer3.7.conv3.weight", "teacher.pretrained.layer3.7.bn3.weight", "teacher.pretrained.layer3.7.bn3.bias", "teacher.pretrained.layer3.7.bn3.running_mean", "teacher.pretrained.layer3.7.bn3.running_var", "teacher.pretrained.layer3.8.conv1.weight", "teacher.pretrained.layer3.8.bn1.weight", "teacher.pretrained.layer3.8.bn1.bias", "teacher.pretrained.layer3.8.bn1.running_mean", "teacher.pretrained.layer3.8.bn1.running_var", "teacher.pretrained.layer3.8.conv2.weight", "teacher.pretrained.layer3.8.bn2.weight", "teacher.pretrained.layer3.8.bn2.bias", "teacher.pretrained.layer3.8.bn2.running_mean", "teacher.pretrained.layer3.8.bn2.running_var", "teacher.pretrained.layer3.8.conv3.weight", "teacher.pretrained.layer3.8.bn3.weight", "teacher.pretrained.layer3.8.bn3.bias", "teacher.pretrained.layer3.8.bn3.running_mean", "teacher.pretrained.layer3.8.bn3.running_var", "teacher.pretrained.layer3.9.conv1.weight", "teacher.pretrained.layer3.9.bn1.weight", "teacher.pretrained.layer3.9.bn1.bias", "teacher.pretrained.layer3.9.bn1.running_mean", "teacher.pretrained.layer3.9.bn1.running_var", "teacher.pretrained.layer3.9.conv2.weight", "teacher.pretrained.layer3.9.bn2.weight", "teacher.pretrained.layer3.9.bn2.bias", "teacher.pretrained.layer3.9.bn2.running_mean", "teacher.pretrained.layer3.9.bn2.running_var", "teacher.pretrained.layer3.9.conv3.weight", "teacher.pretrained.layer3.9.bn3.weight", "teacher.pretrained.layer3.9.bn3.bias", "teacher.pretrained.layer3.9.bn3.running_mean", "teacher.pretrained.layer3.9.bn3.running_var", "teacher.pretrained.layer3.10.conv1.weight", "teacher.pretrained.layer3.10.bn1.weight", "teacher.pretrained.layer3.10.bn1.bias", "teacher.pretrained.layer3.10.bn1.running_mean", "teacher.pretrained.layer3.10.bn1.running_var", "teacher.pretrained.layer3.10.conv2.weight", "teacher.pretrained.layer3.10.bn2.weight", "teacher.pretrained.layer3.10.bn2.bias", "teacher.pretrained.layer3.10.bn2.running_mean", "teacher.pretrained.layer3.10.bn2.running_var", "teacher.pretrained.layer3.10.conv3.weight", "teacher.pretrained.layer3.10.bn3.weight", "teacher.pretrained.layer3.10.bn3.bias", "teacher.pretrained.layer3.10.bn3.running_mean", "teacher.pretrained.layer3.10.bn3.running_var", "teacher.pretrained.layer3.11.conv1.weight", "teacher.pretrained.layer3.11.bn1.weight", "teacher.pretrained.layer3.11.bn1.bias", "teacher.pretrained.layer3.11.bn1.running_mean", "teacher.pretrained.layer3.11.bn1.running_var", "teacher.pretrained.layer3.11.conv2.weight", "teacher.pretrained.layer3.11.bn2.weight", "teacher.pretrained.layer3.11.bn2.bias", "teacher.pretrained.layer3.11.bn2.running_mean", "teacher.pretrained.layer3.11.bn2.running_var", "teacher.pretrained.layer3.11.conv3.weight", "teacher.pretrained.layer3.11.bn3.weight", "teacher.pretrained.layer3.11.bn3.bias", "teacher.pretrained.layer3.11.bn3.running_mean", "teacher.pretrained.layer3.11.bn3.running_var", "teacher.pretrained.layer3.12.conv1.weight", "teacher.pretrained.layer3.12.bn1.weight", "teacher.pretrained.layer3.12.bn1.bias", "teacher.pretrained.layer3.12.bn1.running_mean", "teacher.pretrained.layer3.12.bn1.running_var", "teacher.pretrained.layer3.12.conv2.weight", "teacher.pretrained.layer3.12.bn2.weight", "teacher.pretrained.layer3.12.bn2.bias", "teacher.pretrained.layer3.12.bn2.running_mean", "teacher.pretrained.layer3.12.bn2.running_var", "teacher.pretrained.layer3.12.conv3.weight", "teacher.pretrained.layer3.12.bn3.weight", "teacher.pretrained.layer3.12.bn3.bias", "teacher.pretrained.layer3.12.bn3.running_mean", "teacher.pretrained.layer3.12.bn3.running_var", "teacher.pretrained.layer3.13.conv1.weight", "teacher.pretrained.layer3.13.bn1.weight", "teacher.pretrained.layer3.13.bn1.bias", "teacher.pretrained.layer3.13.bn1.running_mean", "teacher.pretrained.layer3.13.bn1.running_var", "teacher.pretrained.layer3.13.conv2.weight", "teacher.pretrained.layer3.13.bn2.weight", "teacher.pretrained.layer3.13.bn2.bias", "teacher.pretrained.layer3.13.bn2.running_mean", "teacher.pretrained.layer3.13.bn2.running_var", "teacher.pretrained.layer3.13.conv3.weight", "teacher.pretrained.layer3.13.bn3.weight", "teacher.pretrained.layer3.13.bn3.bias", "teacher.pretrained.layer3.13.bn3.running_mean", "teacher.pretrained.layer3.13.bn3.running_var", "teacher.pretrained.layer3.14.conv1.weight", "teacher.pretrained.layer3.14.bn1.weight", "teacher.pretrained.layer3.14.bn1.bias", "teacher.pretrained.layer3.14.bn1.running_mean", "teacher.pretrained.layer3.14.bn1.running_var", "teacher.pretrained.layer3.14.conv2.weight", "teacher.pretrained.layer3.14.bn2.weight", "teacher.pretrained.layer3.14.bn2.bias", "teacher.pretrained.layer3.14.bn2.running_mean", "teacher.pretrained.layer3.14.bn2.running_var", "teacher.pretrained.layer3.14.conv3.weight", "teacher.pretrained.layer3.14.bn3.weight", "teacher.pretrained.layer3.14.bn3.bias", "teacher.pretrained.layer3.14.bn3.running_mean", "teacher.pretrained.layer3.14.bn3.running_var", "teacher.pretrained.layer3.15.conv1.weight", "teacher.pretrained.layer3.15.bn1.weight", "teacher.pretrained.layer3.15.bn1.bias", "teacher.pretrained.layer3.15.bn1.running_mean", "teacher.pretrained.layer3.15.bn1.running_var", "teacher.pretrained.layer3.15.conv2.weight", "teacher.pretrained.layer3.15.bn2.weight", "teacher.pretrained.layer3.15.bn2.bias", "teacher.pretrained.layer3.15.bn2.running_mean", "teacher.pretrained.layer3.15.bn2.running_var", "teacher.pretrained.layer3.15.conv3.weight", "teacher.pretrained.layer3.15.bn3.weight", "teacher.pretrained.layer3.15.bn3.bias", "teacher.pretrained.layer3.15.bn3.running_mean", "teacher.pretrained.layer3.15.bn3.running_var", "teacher.pretrained.layer3.16.conv1.weight", "teacher.pretrained.layer3.16.bn1.weight", "teacher.pretrained.layer3.16.bn1.bias", "teacher.pretrained.layer3.16.bn1.running_mean", "teacher.pretrained.layer3.16.bn1.running_var", "teacher.pretrained.layer3.16.conv2.weight", "teacher.pretrained.layer3.16.bn2.weight", "teacher.pretrained.layer3.16.bn2.bias", "teacher.pretrained.layer3.16.bn2.running_mean", "teacher.pretrained.layer3.16.bn2.running_var", "teacher.pretrained.layer3.16.conv3.weight", "teacher.pretrained.layer3.16.bn3.weight", "teacher.pretrained.layer3.16.bn3.bias", "teacher.pretrained.layer3.16.bn3.running_mean", "teacher.pretrained.layer3.16.bn3.running_var", "teacher.pretrained.layer3.17.conv1.weight", "teacher.pretrained.layer3.17.bn1.weight", "teacher.pretrained.layer3.17.bn1.bias", "teacher.pretrained.layer3.17.bn1.running_mean", "teacher.pretrained.layer3.17.bn1.running_var", "teacher.pretrained.layer3.17.conv2.weight", "teacher.pretrained.layer3.17.bn2.weight", "teacher.pretrained.layer3.17.bn2.bias", "teacher.pretrained.layer3.17.bn2.running_mean", "teacher.pretrained.layer3.17.bn2.running_var", "teacher.pretrained.layer3.17.conv3.weight", "teacher.pretrained.layer3.17.bn3.weight", "teacher.pretrained.layer3.17.bn3.bias", "teacher.pretrained.layer3.17.bn3.running_mean", "teacher.pretrained.layer3.17.bn3.running_var", "teacher.pretrained.layer3.18.conv1.weight", "teacher.pretrained.layer3.18.bn1.weight", "teacher.pretrained.layer3.18.bn1.bias", "teacher.pretrained.layer3.18.bn1.running_mean", "teacher.pretrained.layer3.18.bn1.running_var", "teacher.pretrained.layer3.18.conv2.weight", "teacher.pretrained.layer3.18.bn2.weight", "teacher.pretrained.layer3.18.bn2.bias", "teacher.pretrained.layer3.18.bn2.running_mean", "teacher.pretrained.layer3.18.bn2.running_var", "teacher.pretrained.layer3.18.conv3.weight", "teacher.pretrained.layer3.18.bn3.weight", "teacher.pretrained.layer3.18.bn3.bias", "teacher.pretrained.layer3.18.bn3.running_mean", "teacher.pretrained.layer3.18.bn3.running_var", "teacher.pretrained.layer3.19.conv1.weight", "teacher.pretrained.layer3.19.bn1.weight", "teacher.pretrained.layer3.19.bn1.bias", "teacher.pretrained.layer3.19.bn1.running_mean", "teacher.pretrained.layer3.19.bn1.running_var", "teacher.pretrained.layer3.19.conv2.weight", "teacher.pretrained.layer3.19.bn2.weight", "teacher.pretrained.layer3.19.bn2.bias", "teacher.pretrained.layer3.19.bn2.running_mean", "teacher.pretrained.layer3.19.bn2.running_var", "teacher.pretrained.layer3.19.conv3.weight", "teacher.pretrained.layer3.19.bn3.weight", "teacher.pretrained.layer3.19.bn3.bias", "teacher.pretrained.layer3.19.bn3.running_mean", "teacher.pretrained.layer3.19.bn3.running_var", "teacher.pretrained.layer3.20.conv1.weight", "teacher.pretrained.layer3.20.bn1.weight", "teacher.pretrained.layer3.20.bn1.bias", "teacher.pretrained.layer3.20.bn1.running_mean", "teacher.pretrained.layer3.20.bn1.running_var", "teacher.pretrained.layer3.20.conv2.weight", "teacher.pretrained.layer3.20.bn2.weight", "teacher.pretrained.layer3.20.bn2.bias", "teacher.pretrained.layer3.20.bn2.running_mean", "teacher.pretrained.layer3.20.bn2.running_var", "teacher.pretrained.layer3.20.conv3.weight", "teacher.pretrained.layer3.20.bn3.weight", "teacher.pretrained.layer3.20.bn3.bias", "teacher.pretrained.layer3.20.bn3.running_mean", "teacher.pretrained.layer3.20.bn3.running_var", "teacher.pretrained.layer3.21.conv1.weight", "teacher.pretrained.layer3.21.bn1.weight", "teacher.pretrained.layer3.21.bn1.bias", "teacher.pretrained.layer3.21.bn1.running_mean", "teacher.pretrained.layer3.21.bn1.running_var", "teacher.pretrained.layer3.21.conv2.weight", "teacher.pretrained.layer3.21.bn2.weight", "teacher.pretrained.layer3.21.bn2.bias", "teacher.pretrained.layer3.21.bn2.running_mean", "teacher.pretrained.layer3.21.bn2.running_var", "teacher.pretrained.layer3.21.conv3.weight", "teacher.pretrained.layer3.21.bn3.weight", "teacher.pretrained.layer3.21.bn3.bias", "teacher.pretrained.layer3.21.bn3.running_mean", "teacher.pretrained.layer3.21.bn3.running_var", "teacher.pretrained.layer3.22.conv1.weight", "teacher.pretrained.layer3.22.bn1.weight", "teacher.pretrained.layer3.22.bn1.bias", "teacher.pretrained.layer3.22.bn1.running_mean", "teacher.pretrained.layer3.22.bn1.running_var", "teacher.pretrained.layer3.22.conv2.weight", "teacher.pretrained.layer3.22.bn2.weight", "teacher.pretrained.layer3.22.bn2.bias", "teacher.pretrained.layer3.22.bn2.running_mean", "teacher.pretrained.layer3.22.bn2.running_var", "teacher.pretrained.layer3.22.conv3.weight", "teacher.pretrained.layer3.22.bn3.weight", "teacher.pretrained.layer3.22.bn3.bias", "teacher.pretrained.layer3.22.bn3.running_mean", "teacher.pretrained.layer3.22.bn3.running_var", "teacher.pretrained.layer4.0.conv1.weight", "teacher.pretrained.layer4.0.bn1.weight", "teacher.pretrained.layer4.0.bn1.bias", "teacher.pretrained.layer4.0.bn1.running_mean", "teacher.pretrained.layer4.0.bn1.running_var", "teacher.pretrained.layer4.0.conv2.weight", "teacher.pretrained.layer4.0.bn2.weight", "teacher.pretrained.layer4.0.bn2.bias", "teacher.pretrained.layer4.0.bn2.running_mean", "teacher.pretrained.layer4.0.bn2.running_var", "teacher.pretrained.layer4.0.conv3.weight", "teacher.pretrained.layer4.0.bn3.weight", "teacher.pretrained.layer4.0.bn3.bias", "teacher.pretrained.layer4.0.bn3.running_mean", "teacher.pretrained.layer4.0.bn3.running_var", "teacher.pretrained.layer4.0.downsample.0.weight", "teacher.pretrained.layer4.0.downsample.1.weight", "teacher.pretrained.layer4.0.downsample.1.bias", "teacher.pretrained.layer4.0.downsample.1.running_mean", "teacher.pretrained.layer4.0.downsample.1.running_var", "teacher.pretrained.layer4.1.conv1.weight", "teacher.pretrained.layer4.1.bn1.weight", "teacher.pretrained.layer4.1.bn1.bias", "teacher.pretrained.layer4.1.bn1.running_mean", "teacher.pretrained.layer4.1.bn1.running_var", "teacher.pretrained.layer4.1.conv2.weight", "teacher.pretrained.layer4.1.bn2.weight", "teacher.pretrained.layer4.1.bn2.bias", "teacher.pretrained.layer4.1.bn2.running_mean", "teacher.pretrained.layer4.1.bn2.running_var", "teacher.pretrained.layer4.1.conv3.weight", "teacher.pretrained.layer4.1.bn3.weight", "teacher.pretrained.layer4.1.bn3.bias", "teacher.pretrained.layer4.1.bn3.running_mean", "teacher.pretrained.layer4.1.bn3.running_var", "teacher.pretrained.layer4.2.conv1.weight", "teacher.pretrained.layer4.2.bn1.weight", "teacher.pretrained.layer4.2.bn1.bias", "teacher.pretrained.layer4.2.bn1.running_mean", "teacher.pretrained.layer4.2.bn1.running_var", "teacher.pretrained.layer4.2.conv2.weight", "teacher.pretrained.layer4.2.bn2.weight", "teacher.pretrained.layer4.2.bn2.bias", "teacher.pretrained.layer4.2.bn2.running_mean", "teacher.pretrained.layer4.2.bn2.running_var", "teacher.pretrained.layer4.2.conv3.weight", "teacher.pretrained.layer4.2.bn3.weight", "teacher.pretrained.layer4.2.bn3.bias", "teacher.pretrained.layer4.2.bn3.running_mean", "teacher.pretrained.layer4.2.bn3.running_var", "teacher.pretrained.fc.weight", "teacher.pretrained.fc.bias", "teacher.psp1.conv1.0.weight", "teacher.psp1.conv1.1.weight", "teacher.psp1.conv1.1.bias", "teacher.psp1.conv1.1.running_mean", "teacher.psp1.conv1.1.running_var", "teacher.psp1.conv2.0.weight", "teacher.psp1.conv2.1.weight", "teacher.psp1.conv2.1.bias", "teacher.psp1.conv2.1.running_mean", "teacher.psp1.conv2.1.running_var", "teacher.psp1.conv3.0.weight", "teacher.psp1.conv3.1.weight", "teacher.psp1.conv3.1.bias", "teacher.psp1.conv3.1.running_mean", "teacher.psp1.conv3.1.running_var", "teacher.psp1.conv4.0.weight", "teacher.psp1.conv4.1.weight", "teacher.psp1.conv4.1.bias", "teacher.psp1.conv4.1.running_mean", "teacher.psp1.conv4.1.running_var", "teacher.psp2.conv1.0.weight", "teacher.psp2.conv1.1.weight", "teacher.psp2.conv1.1.bias", "teacher.psp2.conv1.1.running_mean", "teacher.psp2.conv1.1.running_var", "teacher.psp2.conv2.0.weight", "teacher.psp2.conv2.1.weight", "teacher.psp2.conv2.1.bias", "teacher.psp2.conv2.1.running_mean", "teacher.psp2.conv2.1.running_var", "teacher.psp2.conv3.0.weight", "teacher.psp2.conv3.1.weight", "teacher.psp2.conv3.1.bias", "teacher.psp2.conv3.1.running_mean", "teacher.psp2.conv3.1.running_var", "teacher.psp2.conv4.0.weight", "teacher.psp2.conv4.1.weight", "teacher.psp2.conv4.1.bias", "teacher.psp2.conv4.1.running_mean", "teacher.psp2.conv4.1.running_var", "teacher.psp3.conv1.0.weight", "teacher.psp3.conv1.1.weight", "teacher.psp3.conv1.1.bias", "teacher.psp3.conv1.1.running_mean", "teacher.psp3.conv1.1.running_var", "teacher.psp3.conv2.0.weight", "teacher.psp3.conv2.1.weight", "teacher.psp3.conv2.1.bias", "teacher.psp3.conv2.1.running_mean", "teacher.psp3.conv2.1.running_var", "teacher.psp3.conv3.0.weight", "teacher.psp3.conv3.1.weight", "teacher.psp3.conv3.1.bias", "teacher.psp3.conv3.1.running_mean", "teacher.psp3.conv3.1.running_var", "teacher.psp3.conv4.0.weight", "teacher.psp3.conv4.1.weight", "teacher.psp3.conv4.1.bias", "teacher.psp3.conv4.1.running_mean", "teacher.psp3.conv4.1.running_var", "teacher.psp4.conv1.0.weight", "teacher.psp4.conv1.1.weight", "teacher.psp4.conv1.1.bias", "teacher.psp4.conv1.1.running_mean", "teacher.psp4.conv1.1.running_var", "teacher.psp4.conv2.0.weight", "teacher.psp4.conv2.1.weight", "teacher.psp4.conv2.1.bias", "teacher.psp4.conv2.1.running_mean", "teacher.psp4.conv2.1.running_var", "teacher.psp4.conv3.0.weight", "teacher.psp4.conv3.1.weight", "teacher.psp4.conv3.1.bias", "teacher.psp4.conv3.1.running_mean", "teacher.psp4.conv3.1.running_var", "teacher.psp4.conv4.0.weight", "teacher.psp4.conv4.1.weight", "teacher.psp4.conv4.1.bias", "teacher.psp4.conv4.1.running_mean", "teacher.psp4.conv4.1.running_var", "teacher.group1.conv5.0.weight", "teacher.group2.conv5.0.weight", "teacher.group3.conv5.0.weight", "teacher.group4.conv5.0.weight", "teacher.head.conv5.0.weight", "teacher.head.conv5.0.bias", "teacher.head.conv5.0.running_mean", "teacher.head.conv5.0.running_var", "teacher.head.conv5.3.weight", "teacher.head.conv5.3.bias". Unexpected key(s) in state_dict: "epoch", "model_state", "best_iou".

Please correct my mistake, thank you. (Among them, 51 lines of 'psp_path 'parameter seems to be redundant, I deleted it)

feinanshan commented 4 years ago

Hi please try with replacing ''state = torch.load(cfg["validating"]["resume"])#["model_state"]'' into ''state = torch.load(cfg["validating"]["resume"])["model_state"]'' in line https://github.com/feinanshan/TDNet/blob/master/Training/validate.py#L52

872544050 commented 4 years ago

Dear author, thank you very much for your advice. There is no problem in the verification phase. However, the same weight is in the test.py There are still problems in the document: Loading pretrained model from './checkpoint/td4_psp_cityscapes_best_model.pkl' Traceback (most recent call last): File "/mnt/guhuayu/TDNetxin/Testing/test.py", line 95, in test(args) File "/mnt/guhuayu/TDNetxin/Testing/test.py", line 27, in test model = td4_psp18.td4_psp18(nclass=19,path_num=4,model_path=args._td4_psp18_path) File "/mnt/guhuayu/TDNetxin/Testing/model/pspnet/td4_psp18.py", line 117, in init self.pretrained_mp_load() File "/mnt/guhuayu/TDNetxin/Testing/model/pspnet/td4_psp18.py", line 237, in pretrained_mp_load self.load_state_dict(model_state, strict=True) File "/home/usv/anaconda3/envs/TDnet1/lib/python3.7/site-packages/torch/nn/modules/module.py", line 777, in load_state_dict self.class.name, "\n\t".join(error_msgs))) RuntimeError: Error(s) in loading state_dict for td4_psp18: Missing key(s) in state_dict: "pretrained1.conv1.weight", "pretrained1.bn1.weight", "pretrained1.bn1.bias", "pretrained1.bn1.running_mean", "pretrained1.bn1.running_var", "pretrained1.layer1.0.conv1.weight", "pretrained1.layer1.0.bn1.weight", "pretrained1.layer1.0.bn1.bias", "pretrained1.layer1.0.bn1.running_mean", "pretrained1.layer1.0.bn1.running_var", "pretrained1.layer1.0.conv2.weight", "pretrained1.layer1.0.bn2.weight", "pretrained1.layer1.0.bn2.bias", "pretrained1.layer1.0.bn2.running_mean", "pretrained1.layer1.0.bn2.running_var", "pretrained1.layer1.1.conv1.weight", "pretrained1.layer1.1.bn1.weight", "pretrained1.layer1.1.bn1.bias", "pretrained1.layer1.1.bn1.running_mean", "pretrained1.layer1.1.bn1.running_var", "pretrained1.layer1.1.conv2.weight", "pretrained1.layer1.1.bn2.weight", "pretrained1.layer1.1.bn2.bias", "pretrained1.layer1.1.bn2.running_mean", "pretrained1.layer1.1.bn2.running_var", "pretrained1.layer2.0.conv1.weight", "pretrained1.layer2.0.bn1.weight", "pretrained1.layer2.0.bn1.bias", "pretrained1.layer2.0.bn1.running_mean", "pretrained1.layer2.0.bn1.running_var", "pretrained1.layer2.0.conv2.weight", "pretrained1.layer2.0.bn2.weight", "pretrained1.layer2.0.bn2.bias", "pretrained1.layer2.0.bn2.running_mean", "pretrained1.layer2.0.bn2.running_var", "pretrained1.layer2.0.downsample.0.weight", "pretrained1.layer2.0.downsample.1.weight", "pretrained1.layer2.0.downsample.1.bias", "pretrained1.layer2.0.downsample.1.running_mean", "pretrained1.layer2.0.downsample.1.running_var", "pretrained1.layer2.1.conv1.weight", "pretrained1.layer2.1.bn1.weight", "pretrained1.layer2.1.bn1.bias", "pretrained1.layer2.1.bn1.running_mean", "pretrained1.layer2.1.bn1.running_var", "pretrained1.layer2.1.conv2.weight", "pretrained1.layer2.1.bn2.weight", "pretrained1.layer2.1.bn2.bias", "pretrained1.layer2.1.bn2.running_mean", "pretrained1.layer2.1.bn2.running_var", "pretrained1.layer3.0.conv1.weight", "pretrained1.layer3.0.bn1.weight", "pretrained1.layer3.0.bn1.bias", "pretrained1.layer3.0.bn1.running_mean", "pretrained1.layer3.0.bn1.running_var", "pretrained1.layer3.0.conv2.weight", "pretrained1.layer3.0.bn2.weight", "pretrained1.layer3.0.bn2.bias", "pretrained1.layer3.0.bn2.running_mean", "pretrained1.layer3.0.bn2.running_var", "pretrained1.layer3.0.downsample.0.weight", "pretrained1.layer3.0.downsample.1.weight", "pretrained1.layer3.0.downsample.1.bias", "pretrained1.layer3.0.downsample.1.running_mean", "pretrained1.layer3.0.downsample.1.running_var", "pretrained1.layer3.1.conv1.weight", "pretrained1.layer3.1.bn1.weight", "pretrained1.layer3.1.bn1.bias", "pretrained1.layer3.1.bn1.running_mean", "pretrained1.layer3.1.bn1.running_var", "pretrained1.layer3.1.conv2.weight", "pretrained1.layer3.1.bn2.weight", "pretrained1.layer3.1.bn2.bias", "pretrained1.layer3.1.bn2.running_mean", "pretrained1.layer3.1.bn2.running_var", "pretrained1.layer4.0.conv1.weight", "pretrained1.layer4.0.bn1.weight", "pretrained1.layer4.0.bn1.bias", "pretrained1.layer4.0.bn1.running_mean", "pretrained1.layer4.0.bn1.running_var", "pretrained1.layer4.0.conv2.weight", "pretrained1.layer4.0.bn2.weight", "pretrained1.layer4.0.bn2.bias", "pretrained1.layer4.0.bn2.running_mean", "pretrained1.layer4.0.bn2.running_var", "pretrained1.layer4.0.downsample.0.weight", "pretrained1.layer4.0.downsample.1.weight", "pretrained1.layer4.0.downsample.1.bias", "pretrained1.layer4.0.downsample.1.running_mean", "pretrained1.layer4.0.downsample.1.running_var", "pretrained1.layer4.1.conv1.weight", "pretrained1.layer4.1.bn1.weight", "pretrained1.layer4.1.bn1.bias", "pretrained1.layer4.1.bn1.running_mean", "pretrained1.layer4.1.bn1.running_var", "pretrained1.layer4.1.conv2.weight", "pretrained1.layer4.1.bn2.weight", "pretrained1.layer4.1.bn2.bias", "pretrained1.layer4.1.bn2.running_mean", "pretrained1.layer4.1.bn2.running_var", "pretrained1.fc.weight", "pretrained1.fc.bias", "pretrained2.conv1.weight", "pretrained2.bn1.weight", "pretrained2.bn1.bias", "pretrained2.bn1.running_mean", "pretrained2.bn1.running_var", "pretrained2.layer1.0.conv1.weight", "pretrained2.layer1.0.bn1.weight", "pretrained2.layer1.0.bn1.bias", "pretrained2.layer1.0.bn1.running_mean", "pretrained2.layer1.0.bn1.running_var", "pretrained2.layer1.0.conv2.weight", "pretrained2.layer1.0.bn2.weight", "pretrained2.layer1.0.bn2.bias", "pretrained2.layer1.0.bn2.running_mean", "pretrained2.layer1.0.bn2.running_var", "pretrained2.layer1.1.conv1.weight", "pretrained2.layer1.1.bn1.weight", "pretrained2.layer1.1.bn1.bias", "pretrained2.layer1.1.bn1.running_mean", "pretrained2.layer1.1.bn1.running_var", "pretrained2.layer1.1.conv2.weight", "pretrained2.layer1.1.bn2.weight", "pretrained2.layer1.1.bn2.bias", "pretrained2.layer1.1.bn2.running_mean", "pretrained2.layer1.1.bn2.running_var", "pretrained2.layer2.0.conv1.weight", "pretrained2.layer2.0.bn1.weight", "pretrained2.layer2.0.bn1.bias", "pretrained2.layer2.0.bn1.running_mean", "pretrained2.layer2.0.bn1.running_var", "pretrained2.layer2.0.conv2.weight", "pretrained2.layer2.0.bn2.weight", "pretrained2.layer2.0.bn2.bias", "pretrained2.layer2.0.bn2.running_mean", "pretrained2.layer2.0.bn2.running_var", "pretrained2.layer2.0.downsample.0.weight", "pretrained2.layer2.0.downsample.1.weight", "pretrained2.layer2.0.downsample.1.bias", "pretrained2.layer2.0.downsample.1.running_mean", "pretrained2.layer2.0.downsample.1.running_var", "pretrained2.layer2.1.conv1.weight", "pretrained2.layer2.1.bn1.weight", "pretrained2.layer2.1.bn1.bias", "pretrained2.layer2.1.bn1.running_mean", "pretrained2.layer2.1.bn1.running_var", "pretrained2.layer2.1.conv2.weight", "pretrained2.layer2.1.bn2.weight", "pretrained2.layer2.1.bn2.bias", "pretrained2.layer2.1.bn2.running_mean", "pretrained2.layer2.1.bn2.running_var", "pretrained2.layer3.0.conv1.weight", "pretrained2.layer3.0.bn1.weight", "pretrained2.layer3.0.bn1.bias", "pretrained2.layer3.0.bn1.running_mean", "pretrained2.layer3.0.bn1.running_var", "pretrained2.layer3.0.conv2.weight", "pretrained2.layer3.0.bn2.weight", "pretrained2.layer3.0.bn2.bias", "pretrained2.layer3.0.bn2.running_mean", "pretrained2.layer3.0.bn2.running_var", "pretrained2.layer3.0.downsample.0.weight", "pretrained2.layer3.0.downsample.1.weight", "pretrained2.layer3.0.downsample.1.bias", "pretrained2.layer3.0.downsample.1.running_mean", "pretrained2.layer3.0.downsample.1.running_var", "pretrained2.layer3.1.conv1.weight", "pretrained2.layer3.1.bn1.weight", "pretrained2.layer3.1.bn1.bias", "pretrained2.layer3.1.bn1.running_mean", "pretrained2.layer3.1.bn1.running_var", "pretrained2.layer3.1.conv2.weight", "pretrained2.layer3.1.bn2.weight", "pretrained2.layer3.1.bn2.bias", "pretrained2.layer3.1.bn2.running_mean", "pretrained2.layer3.1.bn2.running_var", "pretrained2.layer4.0.conv1.weight", "pretrained2.layer4.0.bn1.weight", "pretrained2.layer4.0.bn1.bias", "pretrained2.layer4.0.bn1.running_mean", "pretrained2.layer4.0.bn1.running_var", "pretrained2.layer4.0.conv2.weight", "pretrained2.layer4.0.bn2.weight", "pretrained2.layer4.0.bn2.bias", "pretrained2.layer4.0.bn2.running_mean", "pretrained2.layer4.0.bn2.running_var", "pretrained2.layer4.0.downsample.0.weight", "pretrained2.layer4.0.downsample.1.weight", "pretrained2.layer4.0.downsample.1.bias", "pretrained2.layer4.0.downsample.1.running_mean", "pretrained2.layer4.0.downsample.1.running_var", "pretrained2.layer4.1.conv1.weight", "pretrained2.layer4.1.bn1.weight", "pretrained2.layer4.1.bn1.bias", "pretrained2.layer4.1.bn1.running_mean", "pretrained2.layer4.1.bn1.running_var", "pretrained2.layer4.1.conv2.weight", "pretrained2.layer4.1.bn2.weight", "pretrained2.layer4.1.bn2.bias", "pretrained2.layer4.1.bn2.running_mean", "pretrained2.layer4.1.bn2.running_var", "pretrained2.fc.weight", "pretrained2.fc.bias", "pretrained3.conv1.weight", "pretrained3.bn1.weight", "pretrained3.bn1.bias", "pretrained3.bn1.running_mean", "pretrained3.bn1.running_var", "pretrained3.layer1.0.conv1.weight", "pretrained3.layer1.0.bn1.weight", "pretrained3.layer1.0.bn1.bias", "pretrained3.layer1.0.bn1.running_mean", "pretrained3.layer1.0.bn1.running_var", "pretrained3.layer1.0.conv2.weight", "pretrained3.layer1.0.bn2.weight", "pretrained3.layer1.0.bn2.bias", "pretrained3.layer1.0.bn2.running_mean", "pretrained3.layer1.0.bn2.running_var", "pretrained3.layer1.1.conv1.weight", "pretrained3.layer1.1.bn1.weight", "pretrained3.layer1.1.bn1.bias", "pretrained3.layer1.1.bn1.running_mean", "pretrained3.layer1.1.bn1.running_var", "pretrained3.layer1.1.conv2.weight", "pretrained3.layer1.1.bn2.weight", "pretrained3.layer1.1.bn2.bias", "pretrained3.layer1.1.bn2.running_mean", "pretrained3.layer1.1.bn2.running_var", "pretrained3.layer2.0.conv1.weight", "pretrained3.layer2.0.bn1.weight", "pretrained3.layer2.0.bn1.bias", "pretrained3.layer2.0.bn1.running_mean", "pretrained3.layer2.0.bn1.running_var", "pretrained3.layer2.0.conv2.weight", "pretrained3.layer2.0.bn2.weight", "pretrained3.layer2.0.bn2.bias", "pretrained3.layer2.0.bn2.running_mean", "pretrained3.layer2.0.bn2.running_var", "pretrained3.layer2.0.downsample.0.weight", "pretrained3.layer2.0.downsample.1.weight", "pretrained3.layer2.0.downsample.1.bias", "pretrained3.layer2.0.downsample.1.running_mean", "pretrained3.layer2.0.downsample.1.running_var", "pretrained3.layer2.1.conv1.weight", "pretrained3.layer2.1.bn1.weight", "pretrained3.layer2.1.bn1.bias", "pretrained3.layer2.1.bn1.running_mean", "pretrained3.layer2.1.bn1.running_var", "pretrained3.layer2.1.conv2.weight", "pretrained3.layer2.1.bn2.weight", "pretrained3.layer2.1.bn2.bias", "pretrained3.layer2.1.bn2.running_mean", "pretrained3.layer2.1.bn2.running_var", "pretrained3.layer3.0.conv1.weight", "pretrained3.layer3.0.bn1.weight", "pretrained3.layer3.0.bn1.bias", "pretrained3.layer3.0.bn1.running_mean", "pretrained3.layer3.0.bn1.running_var", "pretrained3.layer3.0.conv2.weight", "pretrained3.layer3.0.bn2.weight", "pretrained3.layer3.0.bn2.bias", "pretrained3.layer3.0.bn2.running_mean", "pretrained3.layer3.0.bn2.running_var", "pretrained3.layer3.0.downsample.0.weight", "pretrained3.layer3.0.downsample.1.weight", "pretrained3.layer3.0.downsample.1.bias", "pretrained3.layer3.0.downsample.1.running_mean", "pretrained3.layer3.0.downsample.1.running_var", "pretrained3.layer3.1.conv1.weight", "pretrained3.layer3.1.bn1.weight", "pretrained3.layer3.1.bn1.bias", "pretrained3.layer3.1.bn1.running_mean", "pretrained3.layer3.1.bn1.running_var", "pretrained3.layer3.1.conv2.weight", "pretrained3.layer3.1.bn2.weight", "pretrained3.layer3.1.bn2.bias", "pretrained3.layer3.1.bn2.running_mean", "pretrained3.layer3.1.bn2.running_var", "pretrained3.layer4.0.conv1.weight", "pretrained3.layer4.0.bn1.weight", "pretrained3.layer4.0.bn1.bias", "pretrained3.layer4.0.bn1.running_mean", "pretrained3.layer4.0.bn1.running_var", "pretrained3.layer4.0.conv2.weight", "pretrained3.layer4.0.bn2.weight", "pretrained3.layer4.0.bn2.bias", "pretrained3.layer4.0.bn2.running_mean", "pretrained3.layer4.0.bn2.running_var", "pretrained3.layer4.0.downsample.0.weight", "pretrained3.layer4.0.downsample.1.weight", "pretrained3.layer4.0.downsample.1.bias", "pretrained3.layer4.0.downsample.1.running_mean", "pretrained3.layer4.0.downsample.1.running_var", "pretrained3.layer4.1.conv1.weight", "pretrained3.layer4.1.bn1.weight", "pretrained3.layer4.1.bn1.bias", "pretrained3.layer4.1.bn1.running_mean", "pretrained3.layer4.1.bn1.running_var", "pretrained3.layer4.1.conv2.weight", "pretrained3.layer4.1.bn2.weight", "pretrained3.layer4.1.bn2.bias", "pretrained3.layer4.1.bn2.running_mean", "pretrained3.layer4.1.bn2.running_var", "pretrained3.fc.weight", "pretrained3.fc.bias", "pretrained4.conv1.weight", "pretrained4.bn1.weight", "pretrained4.bn1.bias", "pretrained4.bn1.running_mean", "pretrained4.bn1.running_var", "pretrained4.layer1.0.conv1.weight", "pretrained4.layer1.0.bn1.weight", "pretrained4.layer1.0.bn1.bias", "pretrained4.layer1.0.bn1.running_mean", "pretrained4.layer1.0.bn1.running_var", "pretrained4.layer1.0.conv2.weight", "pretrained4.layer1.0.bn2.weight", "pretrained4.layer1.0.bn2.bias", "pretrained4.layer1.0.bn2.running_mean", "pretrained4.layer1.0.bn2.running_var", "pretrained4.layer1.1.conv1.weight", "pretrained4.layer1.1.bn1.weight", "pretrained4.layer1.1.bn1.bias", "pretrained4.layer1.1.bn1.running_mean", "pretrained4.layer1.1.bn1.running_var", "pretrained4.layer1.1.conv2.weight", "pretrained4.layer1.1.bn2.weight", "pretrained4.layer1.1.bn2.bias", "pretrained4.layer1.1.bn2.running_mean", "pretrained4.layer1.1.bn2.running_var", "pretrained4.layer2.0.conv1.weight", "pretrained4.layer2.0.bn1.weight", "pretrained4.layer2.0.bn1.bias", "pretrained4.layer2.0.bn1.running_mean", "pretrained4.layer2.0.bn1.running_var", "pretrained4.layer2.0.conv2.weight", "pretrained4.layer2.0.bn2.weight", "pretrained4.layer2.0.bn2.bias", "pretrained4.layer2.0.bn2.running_mean", "pretrained4.layer2.0.bn2.running_var", "pretrained4.layer2.0.downsample.0.weight", "pretrained4.layer2.0.downsample.1.weight", "pretrained4.layer2.0.downsample.1.bias", "pretrained4.layer2.0.downsample.1.running_mean", "pretrained4.layer2.0.downsample.1.running_var", "pretrained4.layer2.1.conv1.weight", "pretrained4.layer2.1.bn1.weight", "pretrained4.layer2.1.bn1.bias", "pretrained4.layer2.1.bn1.running_mean", "pretrained4.layer2.1.bn1.running_var", "pretrained4.layer2.1.conv2.weight", "pretrained4.layer2.1.bn2.weight", "pretrained4.layer2.1.bn2.bias", "pretrained4.layer2.1.bn2.running_mean", "pretrained4.layer2.1.bn2.running_var", "pretrained4.layer3.0.conv1.weight", "pretrained4.layer3.0.bn1.weight", "pretrained4.layer3.0.bn1.bias", "pretrained4.layer3.0.bn1.running_mean", "pretrained4.layer3.0.bn1.running_var", "pretrained4.layer3.0.conv2.weight", "pretrained4.layer3.0.bn2.weight", "pretrained4.layer3.0.bn2.bias", "pretrained4.layer3.0.bn2.running_mean", "pretrained4.layer3.0.bn2.running_var", "pretrained4.layer3.0.downsample.0.weight", "pretrained4.layer3.0.downsample.1.weight", "pretrained4.layer3.0.downsample.1.bias", "pretrained4.layer3.0.downsample.1.running_mean", "pretrained4.layer3.0.downsample.1.running_var", "pretrained4.layer3.1.conv1.weight", "pretrained4.layer3.1.bn1.weight", "pretrained4.layer3.1.bn1.bias", "pretrained4.layer3.1.bn1.running_mean", "pretrained4.layer3.1.bn1.running_var", "pretrained4.layer3.1.conv2.weight", "pretrained4.layer3.1.bn2.weight", "pretrained4.layer3.1.bn2.bias", "pretrained4.layer3.1.bn2.running_mean", "pretrained4.layer3.1.bn2.running_var", "pretrained4.layer4.0.conv1.weight", "pretrained4.layer4.0.bn1.weight", "pretrained4.layer4.0.bn1.bias", "pretrained4.layer4.0.bn1.running_mean", "pretrained4.layer4.0.bn1.running_var", "pretrained4.layer4.0.conv2.weight", "pretrained4.layer4.0.bn2.weight", "pretrained4.layer4.0.bn2.bias", "pretrained4.layer4.0.bn2.running_mean", "pretrained4.layer4.0.bn2.running_var", "pretrained4.layer4.0.downsample.0.weight", "pretrained4.layer4.0.downsample.1.weight", "pretrained4.layer4.0.downsample.1.bias", "pretrained4.layer4.0.downsample.1.running_mean", "pretrained4.layer4.0.downsample.1.running_var", "pretrained4.layer4.1.conv1.weight", "pretrained4.layer4.1.bn1.weight", "pretrained4.layer4.1.bn1.bias", "pretrained4.layer4.1.bn1.running_mean", "pretrained4.layer4.1.bn1.running_var", "pretrained4.layer4.1.conv2.weight", "pretrained4.layer4.1.bn2.weight", "pretrained4.layer4.1.bn2.bias", "pretrained4.layer4.1.bn2.running_mean", "pretrained4.layer4.1.bn2.running_var", "pretrained4.fc.weight", "pretrained4.fc.bias", "psp1.conv1.0.weight", "psp1.conv1.1.weight", "psp1.conv1.1.bias", "psp1.conv1.1.running_mean", "psp1.conv1.1.running_var", "psp1.conv2.0.weight", "psp1.conv2.1.weight", "psp1.conv2.1.bias", "psp1.conv2.1.running_mean", "psp1.conv2.1.running_var", "psp1.conv3.0.weight", "psp1.conv3.1.weight", "psp1.conv3.1.bias", "psp1.conv3.1.running_mean", "psp1.conv3.1.running_var", "psp1.conv4.0.weight", "psp1.conv4.1.weight", "psp1.conv4.1.bias", "psp1.conv4.1.running_mean", "psp1.conv4.1.running_var", "psp2.conv1.0.weight", "psp2.conv1.1.weight", "psp2.conv1.1.bias", "psp2.conv1.1.running_mean", "psp2.conv1.1.running_var", "psp2.conv2.0.weight", "psp2.conv2.1.weight", "psp2.conv2.1.bias", "psp2.conv2.1.running_mean", "psp2.conv2.1.running_var", "psp2.conv3.0.weight", "psp2.conv3.1.weight", "psp2.conv3.1.bias", "psp2.conv3.1.running_mean", "psp2.conv3.1.running_var", "psp2.conv4.0.weight", "psp2.conv4.1.weight", "psp2.conv4.1.bias", "psp2.conv4.1.running_mean", "psp2.conv4.1.running_var", "psp3.conv1.0.weight", "psp3.conv1.1.weight", "psp3.conv1.1.bias", "psp3.conv1.1.running_mean", "psp3.conv1.1.running_var", "psp3.conv2.0.weight", "psp3.conv2.1.weight", "psp3.conv2.1.bias", "psp3.conv2.1.running_mean", "psp3.conv2.1.running_var", "psp3.conv3.0.weight", "psp3.conv3.1.weight", "psp3.conv3.1.bias", "psp3.conv3.1.running_mean", "psp3.conv3.1.running_var", "psp3.conv4.0.weight", "psp3.conv4.1.weight", "psp3.conv4.1.bias", "psp3.conv4.1.running_mean", "psp3.conv4.1.running_var", "psp4.conv1.0.weight", "psp4.conv1.1.weight", "psp4.conv1.1.bias", "psp4.conv1.1.running_mean", "psp4.conv1.1.running_var", "psp4.conv2.0.weight", "psp4.conv2.1.weight", "psp4.conv2.1.bias", "psp4.conv2.1.running_mean", "psp4.conv2.1.running_var", "psp4.conv3.0.weight", "psp4.conv3.1.weight", "psp4.conv3.1.bias", "psp4.conv3.1.running_mean", "psp4.conv3.1.running_var", "psp4.conv4.0.weight", "psp4.conv4.1.weight", "psp4.conv4.1.bias", "psp4.conv4.1.running_mean", "psp4.conv4.1.running_var", "enc1.w_qs.0.conv.weight", "enc1.w_qs.0.conv.bias", "enc1.w_qs.0.bn.weight", "enc1.w_qs.0.bn.bias", "enc1.w_qs.0.bn.running_mean", "enc1.w_qs.0.bn.running_var", "enc1.w_qs.1.conv.weight", "enc1.w_qs.1.conv.bias", "enc1.w_ks.0.conv.weight", "enc1.w_ks.0.conv.bias", "enc1.w_ks.0.bn.weight", "enc1.w_ks.0.bn.bias", "enc1.w_ks.0.bn.running_mean", "enc1.w_ks.0.bn.running_var", "enc1.w_ks.1.conv.weight", "enc1.w_ks.1.conv.bias", "enc1.w_vs.0.conv.weight", "enc1.w_vs.0.conv.bias", "enc2.w_qs.0.conv.weight", "enc2.w_qs.0.conv.bias", "enc2.w_qs.0.bn.weight", "enc2.w_qs.0.bn.bias", "enc2.w_qs.0.bn.running_mean", "enc2.w_qs.0.bn.running_var", "enc2.w_qs.1.conv.weight", "enc2.w_qs.1.conv.bias", "enc2.w_ks.0.conv.weight", "enc2.w_ks.0.conv.bias", "enc2.w_ks.0.bn.weight", "enc2.w_ks.0.bn.bias", "enc2.w_ks.0.bn.running_mean", "enc2.w_ks.0.bn.running_var", "enc2.w_ks.1.conv.weight", "enc2.w_ks.1.conv.bias", "enc2.w_vs.0.conv.weight", "enc2.w_vs.0.conv.bias", "enc3.w_qs.0.conv.weight", "enc3.w_qs.0.conv.bias", "enc3.w_qs.0.bn.weight", "enc3.w_qs.0.bn.bias", "enc3.w_qs.0.bn.running_mean", "enc3.w_qs.0.bn.running_var", "enc3.w_qs.1.conv.weight", "enc3.w_qs.1.conv.bias", "enc3.w_ks.0.conv.weight", "enc3.w_ks.0.conv.bias", "enc3.w_ks.0.bn.weight", "enc3.w_ks.0.bn.bias", "enc3.w_ks.0.bn.running_mean", "enc3.w_ks.0.bn.running_var", "enc3.w_ks.1.conv.weight", "enc3.w_ks.1.conv.bias", "enc3.w_vs.0.conv.weight", "enc3.w_vs.0.conv.bias", "enc4.w_qs.0.conv.weight", "enc4.w_qs.0.conv.bias", "enc4.w_qs.0.bn.weight", "enc4.w_qs.0.bn.bias", "enc4.w_qs.0.bn.running_mean", "enc4.w_qs.0.bn.running_var", "enc4.w_qs.1.conv.weight", "enc4.w_qs.1.conv.bias", "enc4.w_ks.0.conv.weight", "enc4.w_ks.0.conv.bias", "enc4.w_ks.0.bn.weight", "enc4.w_ks.0.bn.bias", "enc4.w_ks.0.bn.running_mean", "enc4.w_ks.0.bn.running_var", "enc4.w_ks.1.conv.weight", "enc4.w_ks.1.conv.bias", "enc4.w_vs.0.conv.weight", "enc4.w_vs.0.conv.bias", "atn1_2.fc.0.conv.weight", "atn1_2.fc.0.conv.bias", "atn1_3.fc.0.conv.weight", "atn1_3.fc.0.conv.bias", "atn1_4.fc.0.conv.weight", "atn1_4.fc.0.conv.bias", "atn2_1.fc.0.conv.weight", "atn2_1.fc.0.conv.bias", "atn2_3.fc.0.conv.weight", "atn2_3.fc.0.conv.bias", "atn2_4.fc.0.conv.weight", "atn2_4.fc.0.conv.bias", "atn3_1.fc.0.conv.weight", "atn3_1.fc.0.conv.bias", "atn3_2.fc.0.conv.weight", "atn3_2.fc.0.conv.bias", "atn3_4.fc.0.conv.weight", "atn3_4.fc.0.conv.bias", "atn4_1.fc.0.conv.weight", "atn4_1.fc.0.conv.bias", "atn4_2.fc.0.conv.weight", "atn4_2.fc.0.conv.bias", "atn4_3.fc.0.conv.weight", "atn4_3.fc.0.conv.bias", "layer_norm1.ln.weight", "layer_norm1.ln.bias", "layer_norm2.ln.weight", "layer_norm2.ln.bias", "layer_norm3.ln.weight", "layer_norm3.ln.bias", "layer_norm4.ln.weight", "layer_norm4.ln.bias", "head1.conv5.0.weight", "head1.conv5.1.weight", "head1.conv5.1.bias", "head1.conv5.1.running_mean", "head1.conv5.1.running_var", "head1.conv5.4.weight", "head1.conv5.4.bias", "head2.conv5.0.weight", "head2.conv5.1.weight", "head2.conv5.1.bias", "head2.conv5.1.running_mean", "head2.conv5.1.running_var", "head2.conv5.4.weight", "head2.conv5.4.bias", "head3.conv5.0.weight", "head3.conv5.1.weight", "head3.conv5.1.bias", "head3.conv5.1.running_mean", "head3.conv5.1.running_var", "head3.conv5.4.weight", "head3.conv5.4.bias", "head4.conv5.0.weight", "head4.conv5.1.weight", "head4.conv5.1.bias", "head4.conv5.1.running_mean", "head4.conv5.1.running_var", "head4.conv5.4.weight", "head4.conv5.4.bias". Unexpected key(s) in state_dict: "epoch", "model_state", "best_iou".

Kindly give me your advice.

feinanshan commented 4 years ago

Hi at this line of code https://github.com/feinanshan/TDNet/blob/3f8b5378fcc7f97c26b3760ddaf3d4402cf477d1/Testing/model/pspnet/td2_psp50.py#L162 Change ''model_state = torch.load(self.psp_path)'' into ''model_state = torch.load(self.psp_path)["model_state"]''

or you can extract the item "model_state" from the trained model, and save it as a unique file for the verification phase.

872544050 commented 4 years ago

Dear author, thank you very much for your advice.

AlbertHuyb commented 2 years ago

Dear author, thank you very much for your advice.

Did you reproduce the validation result on cityscapes?

I use the provided td4-psp18 model, but only get around 50% mIoU on val data.

AvrahamRaviv commented 1 year ago

Dear author, thank you very much for your advice.

Did you reproduce the validation result on cityscapes?

I use the provided td4-psp18 model, but only get around 50% mIoU on val data.

Hi, here the same... Could you reproduce? Thanks