Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation [AAAI2024]

[Paper Page]

## Setup with Conda ```bash # create conda env conda create -n dir python=3.9 # install torch pip install torch==1.10.0+cu113 torchvision==0.11.1+cu113 --extra-index-url https://download.pytorch.org/whl/cu113 # install other requirements git clone --recursive https://github.com/ru1ven/KeypointFusion.git cd KeypointFusion pip install -r ./requirements.txt ``` ## Dataset preparation Download the [DexYCB dataset](https://dex-ycb.github.io/) and the [annotations](https://drive.google.com/drive/folders/1YAF1jAsGi2aWkTml1tFV2y39aSmIYpde?usp=sharing). ## Training & Evaluation Download our [pre-trained model](https://drive.google.com/file/d/1sl0r62C8c1eYlFKyFGk-CTW2hoXFvqIa/view?usp=sharing) on DexYCB s0. ```bash python train.py ``` you would get the following output: ```bash [mean_Error 6.927] [PA_mean_Error 4.790] ``` Comparison on HO3D can be seen in [here](https://codalab.lisn.upsaclay.fr/competitions/4318#results). ## Running in the wild We update a [demo](https://github.com/ru1ven/KeypointFusion/blob/main/demo_RGBD.py) for running our method in real-world scenes.

The results of KeypointFusion on in-the-wild images.

## BibTeX ``` @inproceedings{liu2024keypoint, title={Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation}, author={Liu, Xingyu and Ren, Pengfei and Gao, Yuanyuan and Wang, Jingyu and Sun, Haifeng and Qi, Qi and Zhuang, Zirui and Liao, Jianxin}, booktitle={Proceedings of the AAAI Conference on Artificial Intelligence}, volume={38}, number={4}, pages={3756--3764}, year={2024} } ```

ru1ven / KeypointFusion

readme

Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation [AAAI2024]

[Paper Page]