3dpose / GnTCN

Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
MIT License
96 stars 13 forks source link

How to apply to videos in the wild? #16

Open hellojialee opened 2 years ago

hellojialee commented 2 years ago

Hi, thanks for your great work. I have just entered into the 3D human pose. I admire your work. Your paper estimates Z/f using weak perspective model, which SMAP estimates Zw/f, in which Z is the original depth, and f and w are the focal length and the image width both in pixels. Which is better? I think your estimation is in the real space and theirs is in a normalization space.

[1] SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation

hellojialee commented 2 years ago

PS. Both of you aim to estimate the distance in meters.