Open Ha0Tang opened 1 year ago
The VideoMAE pre-trained models are trained on datasets of 16 frames per video. Can I fine-tine it on a video dataset of 32 frames?
The VideoMAE pre-trained models are trained on datasets of 16 frames per video. Can I fine-tine it on a video dataset of 32 frames?