dairui01 / MS-TCT

[CVPR2022] MS-TCT
54 stars 11 forks source link

some problems about MS-TCT on charades dataset #3

Closed Wyz2927 closed 1 year ago

Wyz2927 commented 1 year ago

I have some questions about MS-TCT. I try to use the published code to train a MS-TCT model on charades dataset, but get a much lower map value than the paper reported. I only get 19.8750 map on charades dataset. I want to know if there exist some tricks when training the model? And the following is my training set.

image

image

and my training log is as follow: image it seems that the model is easy to be over-fitting.

dairui01 commented 1 year ago

There is no trick for training the model in the code. If all the hyperparameters are the same, then I think the issue lies in the low-quality I3D features. As the MS-TCT inputs the I3D features, thus the feature quality is critical. In our work, we follow exactly the same as this repository to extract the feature: https://github.com/piergiaj/pytorch-i3d

Note that: You can utilize directly their charades pre-trained model to extract the features. However, please do not change any terms relevant to the window size in the original code of I3D feature extraction. This change may hamper the final feature quality. Moreover, please utilize the RGB frames at 24 fps frame version rather than the provided 480p version from their website. Different video resolutions can also cause a difference in the final result.