Open ykqyzzs opened 1 year ago
@ykqyzzs Hi, thanks for your feedback. We also found this bug and we're troubleshooting these days. I have found that the test of MVXNet had some problems. We'll continue to find out. If you have some new discoveries, feel free to post them here.
您好,感谢您的反馈。我们还发现了此错误,这些天我们正在进行故障排除。我发现MVXNet的测试存在一些问题。我们将继续寻找答案。如果您有一些新发现,请随时在此处发布。
I have run the training and test again these days, but the problem of low map value still occurs. May I ask if this problem has been fixed now?
@JingweiZhang12 Is there any update on that?
@ykqyzzs您好,感谢您的反馈。我们也发现了这个错误,这些天我们正在排除故障。我发现MVXNet的测试有一些问题。我们将继续寻找答案。如果您有一些新发现,请随时发布在这里。 @JingweiZhang12 Please, is it fixed? The training was fine before in August, there was a weights file and the test results were all there. But recently, I have trained and tested again and it has always been 0. I am surprised, I took the previous weights file and tested it and the results were normal, so why does it not work again only training the new one? I have not changed the code in any way. Just trained a different number of cycles. Looking forward to your response, Thank you
Thanks for your error report and we appreciate it a lot.
Checklist
Describe the bug Hi, I tested some multimodal model recently, tried to test the mAP of MVXNet on KITTI but got too low mAP after 10 epoch(load the checkpoint by http backend from path provided in config file as pretrained model, chage lr to 0.0001. Actually after 60 epochs still have no improve...) :
----------- AP40 Results ------------
Pedestrian AP40@0.50, 0.50, 0.50: bbox AP40:0.3481, 0.4552, 0.3718 bev AP40:0.0165, 0.0198, 0.0207 3d AP40:0.0192, 0.0150, 0.0161 aos AP40:0.20, 0.26, 0.21 Pedestrian AP40@0.50, 0.25, 0.25: bbox AP40:0.3481, 0.4552, 0.3718 bev AP40:0.2352, 0.2570, 0.1928 3d AP40:0.2070, 0.1641, 0.1738 aos AP40:0.20, 0.26, 0.21 Cyclist AP40@0.50, 0.50, 0.50: bbox AP40:0.0083, 0.0115, 0.0115 bev AP40:0.0000, 0.0000, 0.0000 3d AP40:0.0000, 0.0000, 0.0000 aos AP40:0.00, 0.00, 0.00 Cyclist AP40@0.50, 0.25, 0.25: bbox AP40:0.0083, 0.0115, 0.0115 bev AP40:0.0052, 0.0089, 0.0089 3d AP40:0.0000, 0.0067, 0.0067 aos AP40:0.00, 0.00, 0.00 Car AP40@0.70, 0.70, 0.70: bbox AP40:0.1132, 0.0963, 0.1418 bev AP40:0.0178, 0.0181, 0.0227 3d AP40:0.0126, 0.0098, 0.0109 aos AP40:0.08, 0.06, 0.08 Car AP40@0.70, 0.50, 0.50: bbox AP40:0.1132, 0.0963, 0.1418 bev AP40:0.2990, 0.1687, 0.1875 3d AP40:0.2177, 0.1190, 0.1433 aos AP40:0.08, 0.06, 0.08
Overall AP40@easy, moderate, hard: bbox AP40:0.1565, 0.1877, 0.1750 bev AP40:0.0114, 0.0126, 0.0145 3d AP40:0.0106, 0.0083, 0.0090 aos AP40:0.09, 0.11, 0.10
I noticed that running log shows: The model and loaded state dict do not match exactly: unexpected key in source state_dict: img_rpn_head.rpn_conv.weight, img_rpn_head.rpn_conv.bias, img_rpn_head.rpn_cls.weight, img_rpn_head.rpn_cls.bias, img_rpn_head.rpn_reg.weight, img_rpn_head.rpn_reg.bias, img_bbox_head.fc_cls.weight, img_bbox_head.fc_cls.bias, img_bbox_head.fc_reg.weight, img_bbox_head.fc_reg.bias, img_bbox_head.shared_fcs.0.weight, img_bbox_head.shared_fcs.0.bias, img_bbox_head.shared_fcs.1.weight, img_bbox_head.shared_fcs.1.bias
missing keys in source state_dict: pts_voxel_encoder.vfe_layers.0.0.weight, ......
In addition, if I annotate the code that loads pretrained weights and retrain it, I also got similar results after 40 epochs. I don't know if it's the reason caused my situation. could you please give me some advices?
Reproduction
Did you make any modifications on the code or config? Did you understand what you have modified? I haven't modify the code, I have modified the max_epochs and eval interval in configs/base/default_runtime.py
What dataset did you use? Official KITTI datasets.
Environment
python mmdet3d/utils/collect_env.py
to collect necessary environment information and paste it here.Error traceback the running log was:
Bug fix If you have already identified the reason, you can provide the information here. If you are willing to create a PR to fix it, please also leave a comment here and that would be much appreciated!