facebookresearch / mvit

Code Release for MViTv2 on Image Recognition.
Apache License 2.0
372 stars 43 forks source link

Pretrained ImageNet21k weight for Initialize the MViT for video training #8

Open vateye opened 2 years ago

vateye commented 2 years ago

Hi, I have noticed that the "adaptive kv stride" in configuration for pretrained ImageNet weight is "4, 4". But according to the paper, in the version of Video MViTv2, the "adaptive kv stride" is "1,8,8". Therefore it cannot be directly used for video training initialization. Would you mind sharing the weights used for initialize MViT for video training.