facebookresearch / vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
https://vissl.ai
MIT License
3.24k stars 330 forks source link

First-class support for timm models #569

Open crypdick opened 1 year ago

crypdick commented 1 year ago

🚀 Feature

Pytorch Image Models (aka timm) is a popular computer vision library. If VISSL supported timm models, it would be easy to combine SOTA model architectures from timm with SOTA SSL methods.

Motivation & Examples

timm makes it easy to use hundreds of different model architectures, all with a consistent API. If timm models were supported, it would enable VISSL users to experiment with architectures not currently implemented in torchvision. For users that already use timm, it would reduce the friction for adopting VISSL.

One potential way to achieve this would be a reserved prefix for timm models:

MODEL:
  TRUNK:
    NAME: TIMM-seresnext26t_32x4d
    PRETRAINED: False