Open justheuristic opened 2 years ago
Why: We need memory-efficient vision transformers (both vanilla ViT and SWIN v2) for LAION projects. These models are also generic enough to spark future use.
Why: We need memory-efficient vision transformers (both vanilla ViT and SWIN v2) for LAION projects. These models are also generic enough to spark future use.