Real time implementation

DecaYale / RNNPose

RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization, CVPR 2022

Apache License 2.0

160 stars 17 forks source link

Not very precise. For a single image, we only need one initial pose estimation to process. If you want to process a video, you could take the pose estimation from the previous frame t-1 to be the initial pose estimation of the frame t.
There are still many aspects where you could further improve the speed. For instance, 1) you could re-implement the rendering parts with cuda like RePose; 2) you could implement the LM-optimization with cuda. Also, you could decrease the recurrent iterations and rendering iterations to achieve a tradeoff between the speed and accuracy. Actually, fewer refinement iterations are still enough for most application cases.

DecaYale / RNNPose