xingyizhou / CenterNet2

Two-stage CenterNet
Apache License 2.0
1.19k stars 189 forks source link

hangs while calling comm.synchronize() #84

Closed zjuPeco closed 1 year ago

zjuPeco commented 1 year ago

It's ok to train on one gpu, but hangs at dist.barrier(device_ids=[torch.cuda.current_device()]) on two gpus.

Any advice?

zjuPeco commented 1 year ago

Solved. it's the problem of my machine.