train at mpii,acc so small

yuanyuanli85 / Fast_Human_Pose_Estimation_Pytorch

Pytorch Code for CVPR2019 paper "Fast Human Pose Estimation" https://arxiv.org/abs/1811.05419

Apache License 2.0

325 stars 53 forks source link

train at mpii,acc so small #16

Open mathpopo opened 4 years ago

mathpopo commented 4 years ago

I try to run "python example/mpii.py -a hg --stacks 8 --blocks 1 --checkpoint checkpoint/hg_s8_b1/ " in your code , compare to your log.txt,loss can descend as same with you,but acc samll too mush,just "python example/mpii.py -a hg --stacks 8 --blocks 1 --checkpoint checkpoint/hg_s8_b1/ " is whole? log

yuanyuanli85 commented 4 years ago

looks like the train_loss does not drop too much. Have you follow the instruction to disable cudnn for batchnorm layer if you are using pytorch0.4.x?

mathpopo commented 4 years ago

i use titan xp ,can run well,this effect is in the v100,the two graphics cards are different?

mathpopo commented 4 years ago

disable cudnn for batchnorm layer: sed -i "1194s/torch.backends.cudnn.enabled/False/g"?i do this already,use pytorch 0.4.1,cuda 9.0

mathpopo commented 4 years ago

can you tell me your graphics cards hardware?

mathpopo commented 4 years ago

i use v100,resume from pre-trained model ,acc descend ,use titan xp , go up

yuanyuanli85 commented 4 years ago

The gpu I used is titan xp, not v100. I guess some low-level difference b/w driver/cudnn caused the problem in v100.

mathpopo commented 4 years ago

thank you very mush.sorry ,other issue,if half body & side ,effect so bad,i descend threshold ,point error happen test1

mathpopo commented 4 years ago

this is .0.1 threshold effect test2