una-dinosauria / 3d-pose-baseline

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.
MIT License
1.41k stars 355 forks source link

How to use this method combined with 2D detection on run time #171

Open ShuangjunLiu opened 4 years ago

ShuangjunLiu commented 4 years ago

Thanks for your interest in our research!

I checked the code and found that the SH 2D detections are pre-processed and saved result. According to paper, this approach is estimating from 2D joints in camera coordinates to the depth which suppose to have mm units.
However, SH output is in image space in pixels. Inverse camera projection requests depth for exact recovery. However, depth is not suppose to be available at runtime.
Could you clarify how you got the 2D joints ( mm in camera coordinates) from image space? Did you employ the ground truth depth data?

YimjuneLiu commented 1 year ago

Hello, Can you solve this problem? In my points, the author maybe use a pixel is equal to how many millimeters?