Closed Light-V closed 2 years ago
The main process with local_rank 0 will wait forever because other process will not call torch.distributed.barrier()
The main process with local_rank 0 will wait forever because other process will not call torch.distributed.barrier()