Closed zjersey closed 2 years ago
support training and inference of Vision Transformer (ViT), provide examples; remove useless moe kernel;
support training and inference of Vision Transformer (ViT), provide examples; remove useless moe kernel;