sail-sg / mvp

NeurIPS-2021: Direct Multi-view Multi-person 3D Human Pose Estimation
Apache License 2.0
329 stars 34 forks source link

Singe camera results #16

Closed xiaochehe closed 2 years ago

xiaochehe commented 2 years ago

I have a question about your work. "Direct Multi-view Multi-person 3D Pose Estimation NIPS 2021". Your multiview performance on Panoptic Dataset is much better than VoxelPose. However, why aren't you as good as him with a single view setting. Your MPJPE is 93.8mm while VoxelPose's MPJPE is 66.95mm. And of course, I can't reproduce their results. Could you help me with this problem? Thx

twangnh commented 2 years ago

Thanks for your interest in our work. the performance is lower for the single view setting, this is mainly because voxelpose explicitly regresses the 3d coordinates with the voxel design, this introduces strong architectural prior so that it is advantageous when the depth ambiguity is severe (e.g., single view). Of course, we have not deliberately tried to tuning mvp for the single view setting, maybe it can be improved.

xiaochehe commented 2 years ago

https://github.com/karfly/learnable-triangulation-pytorch/issues/64#issue-588872398

For a voxelpose-based method in a single-view setup, they train their model on 1 – 4 views simultaneously. Are you in a similar way?

twangnh commented 2 years ago

we used only single view training and test for the single-view setup

xiaochehe commented 2 years ago

get it, Thx