The predicted future motion

Hi,

The 2D to 3D motion prediction was introduced by [1]. The authors designed the problem to be:

Input: 2D skeleton in world frame; you will find that we normalize the world frame coordinates in our training code.
Output: 3D skeleton in camera frame for both training and validation. This was used following the authors of [1]. This also allowed us to compare our results with prior approaches reported in our paper.

Thus, the visualizations you see in figures 4 & 5 are in camera frame. While the visualization you see in figure 1 is in world frame., and this is more of an art than a result. The camera parameters can be found in the original dataset files for both GTA-IM and PROX in case you needed to do the 3D visualization in the world frame. Other than this, using the visualization script in our repo will allow you directly to visualize in the camera frame generating the same figures as figures 4&5.

[1] Cao, Zhe, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, and Jitendra Malik. "Long-term human motion prediction with scene context." In European Conference on Computer Vision, pp. 387-404. Springer, Cham, 2020.

abduallahmohamed / Skeleton-Graph

The predicted future motion #1