rohitgirdhar / AttentionalPoolingAction

Code/Model release for NIPS 2017 paper "Attentional Pooling for Action Recognition"
https://rohitgirdhar.github.io/AttentionalPoolingAction/
Other
258 stars 63 forks source link

About loss value after 12000 steps. #4

Closed YinRui1991 closed 6 years ago

YinRui1991 commented 6 years ago

@rohitgirdhar Thanks for your excellent code! I tested the trained models on the validation set, but the result of attentional model is: Mean AP: 0.0408322168145 Accuracy: 0.0651116199199; result of the pose attentional model is: Mean AP: 0.0203709144522 Accuracy: 0.0435031482541. These two models are got after 12000 training steps without pretrained models. For attentional model, the loss decreased from 32.7 to 27.2, and for pose attentional model, the loss decreased from 32.7 to 30.2. Can you share the final loss of these two models? Maybe I need train more steps to decrease the loss.

rohitgirdhar commented 6 years ago

Thanks for the feedback! These loss/mAP values indicate something is going wrong. The loss should start around 8~10 I think. Can you try running the pre-trained models I provide to see what performance you get with those?

YinRui1991 commented 6 years ago

@rohitgirdhar Thanks for your reply. I will download the pre-trained models and start training with them.

YinRui1991 commented 6 years ago

@rohitgirdhar Thanks for your advice, I run the pre-trained models and get the same performance showed in readme. Then I start training with pre-trained models and the loss value is about 2. I think my problem has been solved. Thanks again!

rohitgirdhar commented 6 years ago

Great, thanks for checking! However, you should be able to train directly from ImageNet models as well (as shown in training instructions). Make sure you are using the right initialization models.

ghost commented 5 years ago

Hi @rohitgirdhar . Thanks for the code and . Do you have some comparisons of the accuracies with other implementations ? In the paper, I could find mAP values compared. Do you think if accuracies can be improved further as I am focusing mainly on classification accuracy.

Thanks.