princeton-vl / pose-hg-train

Training and experimentation code used for "Stacked Hourglass Networks for Human Pose Estimation"
Other
575 stars 185 forks source link

The difference between your pretrained model and that generated from your script? #24

Closed yq1011 closed 7 years ago

yq1011 commented 7 years ago

Hi!

Thanks for your shared code. I find that there is a little bit differences between your pretrained model and the model I generate from your code.

There is a 1GB cuda memory gap during training between them.

Could you tell me what could be the reason of that?

Thanks!

yq1011 commented 7 years ago

BTW, what's the valid set for your pretrained model? I set the validImgs as 1000 and the valid accuracy is so high.

I guess I should set the validImgs as 3000, just as your previous opt setting?

anewell commented 7 years ago

The pretrained model is the version used for the paper, but I have since added an extra residual module which introduces extra computation at a high resolution. That module is responsible for the memory difference you are seeing, but it should help performance.

As for the validation set, that was my bad for changing things up and having the code initialize a random validation split. I have just pushed an update that will instead choose the split used in our paper by default. But if you do want to initialize a random validation split, you can now use the flag '-randomValid'.

yq1011 commented 7 years ago

I see, thanks!