evaluation - Githubissues

I believe you are essentially correct: the model is trained on heatmaps (generated from ground truth) at lower resolution, and this probably causes very slight loss of keypoint prediction precision. The hourglass structure works at this resolution for input and output as well, and could be modified to use higher resolution at the cost of (a lot of) memory. Considering the model is pretty memory intensive, and adding more "stacks" helps results, the sacrifice in lower precision I think seemed justified given other concerns and the dataset.

I personally did not experiment with higher resolution heatmap training, but it is possible this could make a small difference in results. Ground truths should not be biased due to this downsampling however, so I would assume the possible gains are pretty small, at least in the MPII setting. I'm curious to hear what sort of results you could get running this ablation!

princeton-vl / pytorch_stacked_hourglass

evaluation #13