Pretrained model with R(2+1) D backbone from Table 10

facebookresearch / AVID-CMA

Audio Visual Instance Discrimination with Cross-Modal Agreement

Other

127 stars 18 forks source link

Thanks for releasing the code and pretrained models of your amazing work "Audio-Visual Instance Discrimination with Cross-Modal Agreement". I noticed that you used different architectures for R(2+1)D in different experiments as shown in Table 9 and Table 10. Can you please release/share the kinetics pretrained model where you used the architecture of Table 10 for the R(2+1) D backbone? I am working on self-supervised learning and want to include your paper in my current project. For comparison purposes, I want to use the same backbone as done by previous works.

facebookresearch / AVID-CMA

Pretrained model with R(2+1) D backbone from Table 10 #6