facebookresearch / r3m

Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
https://sites.google.com/view/robot-r3m/
MIT License
292 stars 45 forks source link

Do we have a vit version r3m pretrained model, besides resnet? #29

Open mingxiaohuo opened 1 year ago

mingxiaohuo commented 1 year ago

I want to deal with a video sequence using r3m model, but I can only find resnet pretrained model. If there is only resnet model, how can the model deal with a video sequence(what's the input and output)