Closed klightz closed 2 years ago
Hi,
I think there are several potential reasons that might cause this issue.
The first is that did all weights are loaded properly from the checkpoint? Because the share between different spconv version might be different, caused from the Algo of convolution layers. There should be warnings reports on the testing log, if this is the reason. But I think the probability of this is small. Because the performance should be much lower than this, not only 4 points.
The second potential reason might be from the dataset info files. They should be regenerated to include the image parts. Would you please provide your "infos_val_10sweeps_withvelo_filter_True.pkl" file? I will check it on our machine. In addition, this is the file I use. You can check it on yours.
The third potential reason is that I change this file a week ago, commit. I am not sure whether this commit will cause this issue. It seems that this should not be relavant. You can roll it back for checking on the testing only.
In addition, the spconv version we have tested on is 2.1.21 and before. I am not so sure whether the update on spconv 2.1.22 causes this issue.
You can contact me at the WeChat 13261057196 to have a more frequent discussion, in case that I am not alway checking the GitHub issues in time.
Thanks a lot. I agree that the second one might be the possible reason since I do not make any specific change for that file after running the script from CenterPoint. Here is my val pkl file Google Drive.
BTW, I can not see your URL for infos_val_10sweeps_withvelo_filter_True.pkl from the above message, maybe you could have a double check?
I may also try to downgrade the spconv for a try. Thanks a lot for the quick reply.
Sorry. I am trying to upload it to the google drive. But the speed is a bit slow. Hope for your understanding.
Of course, take your time, just to ensure I do not miss something.
Thanks for pointing out this issue. Note that this is fixed by this commit.
Hi klightz, I have faced the same issue like you. Did you solve the problem? Was it nfos_val_10sweeps_withvelo_filter_True.pkl lack of image info that causes this gap? Or different version of spconv? Looking forward to your reply, thanks a lot! : D
Hello, sorry for another question, recently I try to train the model by myself, and I get a 4-5 points lower result than the README report. So I try to load the checkpoint you provide first for a valid test.
However, I found that the checkpoint I loaded also in a lower performance. I also tried to setup the environment independently on my another machine and directly do the checkpoint test, still get a worse result, same number as my previous one. I suspect it may caused by some package version mismatch, or some api behavior. Any Idea about this?
I think I exactly follow the guideline for the dataset and codebase setup. For the evaluation command, I follow the README and run:
Here are my key packages version:
The output of the prediction:
So you can see i get a 60.3 on the full dataset, and I get 56 map on 1/4 dataset.
BTW, I also check the dataset correctness by observing the behavior of the CenterPoint performance. I load the checkpoint under this setup centerpoint_voxel_1440 and I can exactly get the 59.6 mAP they as report. So i think something goes wrong with the image fusion part. Any Idea about this issue? Really thanks a lot!