una-dinosauria / 3d-pose-baseline

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.
MIT License
1.41k stars 355 forks source link

Accuracy of 3D coordinates on own images without tuning the model #173

Closed kaurrachneet6 closed 4 years ago

kaurrachneet6 commented 4 years ago

If I have computed the 2D coordinates in pixels using my own camera and then used your code to compute 3D coordinates in cm (in camera coordinates) without tuning the model, i.e. I used your code directly to infer 3D points (because in my data, actual 3D coordinates are unknown and hence, I cannot tune the model). Would this method still give me correct 3D coordinates in cm? Or will it not, because my camera is different than what was used for training the model. Thanks.

una-dinosauria commented 4 years ago

Hi @kaurrachneet6,

Please note that, in general, it is impossible for any 2d->3d model to infer the scale of the world from an arbitrary image (this follows from projective geometry). Therefore, many computer vision applications (ie SFM) produce reconstructions up to scale. This is the reason why we evaluate our model using Procrustes alignment, which accounts for global translation, rotation, and scale.

Cheers,