mit-han-lab / bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
https://bevfusion.mit.edu
Apache License 2.0
2.22k stars 405 forks source link

Inconsistent reproducing results on the nuscene dataset! #630

Closed zyqww closed 1 month ago

zyqww commented 1 month ago

I downloaded the latest code and checkpoint files When testing the Lidar-only and Camera-only models, the results obtained were mAP=0.6468, NDS=0.6924 and mAP=0.3554, NDS=0.4121, which are very similar to the paper results, using the commands respectively:

zyqww commented 1 month ago

It is worth mentioning that I can only train on one RTX3090 now, samples_per_gpu and workers_per_gpu are set to 2 and 0 respectively, and there will be an error when training with the original parameters, that is "ValueError: matrix contains invalid numeric entries". I will try to change the learning rate for this problem, but I want to ask if this training method is feasible at present? In addition, the training time is also very long, how can I change it to train on four RTX3090?

zhijian-liu commented 1 month ago

Thank you for your interest in our project. This repository is no longer actively maintained, so we will be closing this issue. Please refer to the amazing implementation at MMDetection3D. Thank you again!