ethnhe / PVN3D

Code for "PVN3D: A Deep Point-wise 3D Keypoints Hough Voting Network for 6DoF Pose Estimation", CVPR 2020
MIT License
489 stars 105 forks source link

Time and Space Complexity #10

Closed zx19950617 closed 4 years ago

zx19950617 commented 4 years ago

Thanks for your excellent work and sharing! Could you summarize space and time complexity of PVN3D, please? PVN3D seems to achieve real time performance, right? Any quantitative metric?

Besides, my training procedure is slightly inefficient. As exmaple, the training of --cls ape is conducted on two Tesla t4 GPUs, each with 16 G memory. Detailed training paremeters remain unchanged except the self.mini_batch_size and self.val_mini_batch_size (default 24 to 8). Unacceptably, the total training time may take around 10 days for just the class ape. So I'd like to confirm if any problems occured in my training stage. 图片 图片 图片

Thanks again! And great appreciation for any help!

ethnhe commented 4 years ago

During inference, it takes 0.17 seconds for network forward propagation, most of which spent on PointNet++, and 0.02 seconds for pose estimation of each object on an RTX 2080 Ti. The overall runtime is 5 FPS on the LineMOD dataset. It takes about 36 hours training the total 25 epochs with 8 RTX 2080 Ti. But you don't need to finish all the 25 epochs, it usually get a good result with about 9-10 epochs and best result with about 18-20 epochs.

zx19950617 commented 4 years ago

OK, thanks for your experience sharing and suggestion!
I got a general understanding about the training and inference of PVN3D.