OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation.
Figure 1: OmniPose framework for multi-person pose estimation. The input color image of dimensions (HxW) is fed through the improvedHRNet backbone and WASPv2 module to generate one heatmap per joint, or class.
We propose OmniPose, a multi-scale framework for multi-person pose estimation. The OmniPose architecture leverages multi-scale feature representations to increase the effectiveness of backbone feature extractors, with no significant increase in network size and no postprocessing.
The OmniPose framework incorporates contextual information across scales and joint localization with Gaussian heatmap modulation at the multi-scale feature extractor to estimate human pose with state-of-the-art accuracy.
The multi-scale representations allowed by the improved waterfall module in the OmniPose framework leverage the efficiency of progressive filtering in the cascade architecture, while maintaining multi-scale fields-of-view comparable to spatial pyramid configurations.
Our results on multiple datasets demonstrate that OmniPose, with an improved HRNet backbone and waterfall module, is a robust and efficient architecture for multi-person pose estimation with state-of-the-art results.
We propose the upgraded “Waterfall Atrous Spatial Pyramid” module, shown in Figure 2. WASPv2 is a novel architecture with Atrous Convolutions that is able to leverage both the larger Field-of-View of the Atrous Spatial Pyramid Pooling configuration and the reduced size of the cascade approach.
Figure 2: WASPv2 Module.
Figure 3: Pose estimation samples for OmniPose.
Link to the published article at ArXiv.
Datasets used in this paper and required for training, validation, and testing can be downloaded directly from the dataset websites below:
COCO Dataset: https://cocodataset.org/
MPII Dataset: http://human-pose.mpi-inf.mpg.de/
The pre-trained weights for OmniPose can be downloaded at here. The pre-trained weights for HRNet can be downloaded at here.
Bruno Artacho:
E-mail: bmartacho@mail.rit.edu
Website: https://www.brunoartacho.com
Andreas Savakis:
E-mail: andreas.savakis@rit.edu
Website: https://www.rit.edu/directory/axseec-andreas-savakis
Artacho, B.; Savakis, A. OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation. in ArXiv, 2021.
```
@InProceedings{Artacho_2021_ArXiv,
title = {OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation},
author = {Artacho, Bruno and Savakis, Andreas},
eprint={2103.10180},
archivePrefix={arXiv},
primaryClass={cs.CV},
year = {2021},
}
```
Artacho, B.; Savakis, A. UniPose+: A unified framework for 2D and 3D human pose estimation in images and videos. on IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021.
```
@article{Artacho_2021_PAMI,
title = {UniPose+: A unified framework for 2D and 3D human pose estimation in images and videos},
author = {Artacho, Bruno and Savakis, Andreas},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
year = {2021},
}
```
Artacho, B.; Savakis, A. UniPose: Unified Human Pose Estimation in Single Images and Videos. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
```
@inproceedings{Artacho_2020_CVPR,
title = {UniPose: Unified Human Pose Estimation in Single Images and Videos},
author = {Artacho, Bruno and Savakis, Andreas},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2020}
}
```