taoyang1122 / adapt-image-models

[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
Apache License 2.0
276 stars 21 forks source link

Can't reproduce Diving48 training #5

Closed ahnGeo closed 1 year ago

ahnGeo commented 1 year ago

Hello!

I want to reproduce your model training on Diving48, but failed. I used your diving48 config file, vitclip_base_diving48.py, with (1) original ver (2) clip len = 8, frame interval = 8, and command bash tools/dist_train.sh <PATH/TO/CONFIG> <NUM_GPU> --test-best --validate --cfg-options work_dir=<PATH/TO/OUTPUT>.

I wonder what is the problem. Please let me know. Thank you.

taoyang1122 commented 1 year ago

Hi @ahnGeo , thanks for your interest in our work. The results in our paper is based on 32 frames. I am not sure about the performance of 8 frames, but the performance seems to be a little low in your case. 1. Please make sure you changed all clip_len=8, including train/val/test and model num_frames. 2. It seems your batchsize is small. Our defaults setting is 8GPU with batchsize=64. If you use different number of GPUs and batchsize, you may need to tune the learning rate to get the best performance. Hope it helps.