I have a question about the way you generate the 3D keypoints xyz coordinates from the expmaps. As shown in this line this, all keypoints are calculated by the keypoint coordinate from the first frame p3d[0, i, :], which will generate errornous GTs(unless there is only 1 frame or batch_size=1, but in your dataloader this is not the case)
I think this line should be changed to
p3d[:, i, :] = torch.matmul(R[:, parent[i], :, :], p3d[:, i, :].unsqueeze(-1)).squeeze() + p3d[:, parent[i], :]
Hi Wei,
I have a question about the way you generate the 3D keypoints xyz coordinates from the expmaps. As shown in this line this, all keypoints are calculated by the keypoint coordinate from the first frame
p3d[0, i, :]
, which will generate errornous GTs(unless there is only 1 frame or batch_size=1, but in your dataloader this is not the case)I think this line should be changed to
?
Or did I miss something? Best