I was wondering if someone could provide me with some details about the outputs from slahmr. I'm currently getting the outputs from the world_results.npz files, and am printing out all the outputs as well as their shapes. Here is what I'm getting.
I'm assuming 146 represents the number of frames in the video, but correct me if I'm wrong. I have a lot of assumptions based on reading the code and the paper of what these mean, but I just wanted to clarify . I am mostly curious of which outputs are positions versus orientations and which form if so.
I'm especially interested in the outputs hand_pose, trans, trans_vel, pose_body, root_orient, root_orient_vel, joints_vel, and latent_motion. Thanks a lot for the help!
Hello!
I was wondering if someone could provide me with some details about the outputs from slahmr. I'm currently getting the outputs from the world_results.npz files, and am printing out all the outputs as well as their shapes. Here is what I'm getting.
I'm assuming 146 represents the number of frames in the video, but correct me if I'm wrong. I have a lot of assumptions based on reading the code and the paper of what these mean, but I just wanted to clarify . I am mostly curious of which outputs are positions versus orientations and which form if so.
I'm especially interested in the outputs
hand_pose
,trans
,trans_vel
,pose_body
,root_orient
,root_orient_vel
,joints_vel
, andlatent_motion
. Thanks a lot for the help!