Thanks for the good work! I wonder why the 2D inputs are no normalized to the hip joint while the 3D outputs are.
If I understand correctly, there is a postprocess_3d function but not a 2D counterpart. Without normalizing/post-processing the 2D inputs, the network is not translational invariant for 3D pose estimation.
Thanks for the good work! I wonder why the 2D inputs are no normalized to the hip joint while the 3D outputs are.
If I understand correctly, there is a postprocess_3d function but not a 2D counterpart. Without normalizing/post-processing the 2D inputs, the network is not translational invariant for 3D pose estimation.