Closed liuhualin333 closed 4 years ago
Hi, thank you for your interest.
For Kinetics400, our train set has 232422 videos, val set has 19141 videos. It's also not a complete version. As long as your number of videos is not far from this, I think it won't affect much. BTW, we only use the train set for self-supervised training.
I just added an example in process_data/src/ for resizing videos to short size == 256.
Hi Tengda, thanks for your prompt reply!
I am currently downloading Kinetics-400 myself and the expected video number for training set is around 210000 only (considering the dataset is out for several years). I think it could have a potential huge impact on performance. It would be highly appreciated if you could kindly share a dropbox copy of your dataset by email for academic purposes🙏.
Thx for providing the cmd!
I have run your pre-training code and noticed that with multi-gpu setting your DataParallel nn module would gather multiple copies of loss and accuracies which would cause error in the main training procedure. The exact command to replicate is python main.py --gpu 0,1 --net resnet18 --dataset ucf101 --batch_size 16 --img_dim 128 --epochs 500
The line which throws the error is in main.py is listed here: 200 accuracy[i][0].update(top1.item(), B)
Hi,
For Kinetics dataset: Sorry, I don't think I can re-distribute the videos due to potential copyright issues, also Kinetics400 is too large to save in online storage. From my experience, 10% fewer videos will not have a huge impact on performance. If you really want to match the number of videos, one possible solution is to download some videos (from the same class) from Kinetics600 or Kinetics700. e.g. 300+ action classes from K400 are included in K700.
For the error: Thanks for pointing out. I corrected it just now. Hope it works well.
Thanks for the advice. Close the issue now.
Hi Tengda,
Thanks for sharing your work! I have some questions regarding the Kinetics-400.
Looking forward to hearing from you!
Best Regards, Hualin