Closed melihcatal closed 10 months ago
CNN models (i.e., resnet18,50 etc.) work well. vision transformer models (i.e., swin_v2_s) train on a single GPU; however, DDP training fails.
CNN models (i.e., resnet18,50 etc.) work well. vision transformer models (i.e., swin_v2_s) train on a single GPU; however, DDP training fails.