mks0601 / V2V-PoseNet_RELEASE

Official Torch7 implementation of "V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map", CVPR 2018
https://arxiv.org/abs/1711.07399
MIT License
377 stars 69 forks source link

Estimate 3D hand pose in ASL Finger Spelling Dataset #44

Closed Bob130 closed 5 years ago

Bob130 commented 5 years ago

Can I estimate 3D hand pose (uvd) in ASL Finger Spelling Dataset using the pretrained models? Note that the resolution of the given depth images is different and there is no camera configuration. Thanks in advance~

mks0601 commented 5 years ago

If the camera intrinsic parameters are not available, the depth images cannot be converted to the point cloud. If you want to use orthographical perspective, then just lift the depth images and converting them to voxel representation should work.