Closed klightz closed 2 years ago
Thanks for your interests in our works.
This is a bug from the PyTorch later than 1.7.0. I just updated code and fixed it. I test the codes on PyTorch 1.11.0. It is runable. Please use the updated code of this file. https://github.com/dvlab-research/FocalsConv/blob/master/CenterPoint/det3d/models/fusion/voxel_with_point_projection.py
Thanks for your interests in our works.
This is a bug from the PyTorch later than 1.7.0. I just updated code and fixed it. I test the codes on PyTorch 1.11.0. It is runable. Please use the updated code of this file. https://github.com/dvlab-research/FocalsConv/blob/master/CenterPoint/det3d/models/fusion/voxel_with_point_projection.py
Really thanks a lot, it works for me.
For a quick addition question, I find training on nuScene 1/4 using Focal-multimodal comes out much slower than Centerpoint ( 4xV100, 40cpus with 256GB RAM). Especially it will becomes extreme slow in some batches forward and backward (it seems not not data loading issue) and normal in other batches.
Any idea about where can potentially cause this issue? I may need a faster training speed rather than the best performance so I may need slightly modify it if possible. Really thanks!
Would you please provide some logs or hints on this problem?
Nvm, it should be some data loading problem, I am not sure why the time is count to the forward time. I will have a detailed check and temporary close this issue. If i can not solve it anyway, I may paste some log information here. A lot of thanks for the quick reply!
Thanks for your information. Please feel free to reopen it.
Great thanks, maybe one more question to ask regarding this issue, what is the RAM and how many CPU are you using with the 4GPU nuScene training? It would be helpful for my time bottleneck analysis.
We use 4 NVIDIA V100 GPUs and 32 CPU cores for training it.
Thanks for contributing this wonderful work.
Previously when I run Focal Conv on Kitti, every thing is OK. However, when I try to train on nuScene using nusc_centerpoint_voxelnet_0075voxel_fix_bn_z_focal. I got an Error :
I also try to run the config of the normal CenterPoint with voxel net nusc_centerpoint_voxelnet_0075voxel_fix_bn_z.py in this repo and it trains smoothly. So I guess it is some problems occurs in the Focal Conv layer. Any Idea about this problem? Any hint or suggestion about the possible error place to look into also helps. A lot of Thanks.