THU-DA-6D-Pose-Group / GDR-Net

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. (CVPR 2021)
https://github.com/THU-DA-6D-Pose-Group/GDR-Net
Apache License 2.0
277 stars 47 forks source link

Ambiguities for Symmetric Objects #94

Closed pyni closed 1 year ago

pyni commented 2 years ago

Hi. After reading your codes. I have question about the ambiguities for symmetric objects.

It seems that the symmetry is only applied in Patch-P𝑛P network. But for Dense Correspondences Maps and Surface Region Attention Maps, they are regressed directly using equation (8) (L1 loss and CE loss). According to paper [1], this setting cannot handle symmetric objects since it penalizes pixels that have larger distances in the 3D space without any knowledge of the symmetry. It seems they should be:

image (Equation(2) in paper[1])

So why they(M_SRA and M_2D-3D) can implicitly represent the symmetry of an object in your paper? Thanks.

[1]Park K, Patten T, Vincze M. Pix2pose: Pixel-wise coordinate regression of objects for 6d pose estimation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019: 7668-7677.

wangg12 commented 1 year ago

Hi, because after optimization of CE loss, if the M_SRA has multiple peaks with similar probability, it is very likely there are some some ambiguities in the surface. This intuition is from the EPOS paper. You can also try the method in Pix2Pose for explicitly handling symmetry.

pyni commented 1 year ago

OK. Thanks for replying. I understand for M_SRA. But for M_2D-3D, maybe this problem still exists. Am I right?

wangg12 commented 1 year ago

yes.