ColorHandPose3D is a Convolutional Neural Network estimating 3D Hand Pose from a single RGB Image. See the project page for the dataset used and additional information.
The network ships with a minimal example, that performs a forward pass and shows the predictions.
You can compare your results to the content of the folder "results", which shows the predictions we get on our system.
Recommended system (tested):
Python packages used by the example provided and their recommended version:
In order to use the training and evaluation scripts you need download and preprocess the datasets.
Run
python3.5 create_binary_db.py
After unzipping the dataset run
cd ./data/stb/
matlab -nodesktop -nosplash -r "create_db"
We provide scripts to train HandSegNet and PoseNet on the Rendered Hand Pose Dataset (RHD). In case you want to retrain the networks on new data you can adapt the code provided to your needs.
The following steps guide you through training HandSegNet and PoseNet on the Rendered Hand Pose Dataset (RHD).
You should be able to obtain results that roughly match the following numbers we obtain with Tensorflow v1.3:
eval2d_gt_cropped.py yields:
Evaluation results:
Average mean EPE: 7.630 pixels
Average median EPE: 3.939 pixels
Area under curve: 0.771
eval2d.py yields:
Evaluation results:
Average mean EPE: 15.469 pixels
Average median EPE: 4.374 pixels
Area under curve: 0.715
Because training itself isn't a deterministic process results will differ between runs. Note that these results are not listed in the paper.
There are four scripts that evaluate different parts of the architecture:
This provides the possibility to reproduce results from the paper that are based on the RHD dataset.
This project is licensed under the terms of the GPL v2 license. By using the software, you are agreeing to the terms of the license agreement.
Please cite us in your publications if it helps your research:
@InProceedings{zb2017hand,
author = {Christian Zimmermann and Thomas Brox},
title = {Learning to Estimate 3D Hand Pose from Single RGB Images},
booktitle = "IEEE International Conference on Computer Vision (ICCV)",
year = {2017},
note = "https://arxiv.org/abs/1705.01389",
url = "https://lmb.informatik.uni-freiburg.de/projects/hand3d/"
}