Open amthreeh opened 1 week ago
Why are you using only a single GPU? If you use DistributedDataParallel or DataParallel, does it slow down?
Due to limitations in laboratory computing resources at the time, I only used one GPU. It is feasible to use multiple GPUs during training.
Why are you using only a single GPU? If you use DistributedDataParallel or DataParallel, does it slow down?