Thank you for the excellent work! There is a problem I met when running on a custom dataset:
Your paper says"Given a monocular RGBD input video, along with a segmentation mask of the object of interest in the first frame only, our method tracks the 6-DoF pose of the object through subsequent frames and reconstructs a textured 3D model of the object." Why it asks for the second masks?(My data start from 000010.png)
Hi authors,
Thank you for the excellent work! There is a problem I met when running on a custom dataset:
Your paper says"Given a monocular RGBD input video, along with a segmentation mask of the object of interest in the first frame only, our method tracks the 6-DoF pose of the object through subsequent frames and reconstructs a textured 3D model of the object." Why it asks for the second masks?(My data start from 000010.png)