Closed memray closed 5 years ago
has been addressed nefore, you need to change the port on one of the tasks. check here: https://github.com/OpenNMT/OpenNMT-py/blob/master/onmt/opts.py#L329
Cool! It solved my problem. Thanks!
Rui
@vince62s I am having the same problem. How did you fix it? Not sure how to change the port on one of the tasks. Please let me know! @memray
@vince62s Just as Vincent suggested, I set different master_port for each experiment (say 10000 for your exp1 with GPU 0,1 and 10001 for your exp2 with GPU 2,3), and it works.
Hi,
I was trying to run two OpenNMT tasks on a single node at the same time. I set two tasks as
-world_size 2 -gpu_ranks 0 1
, and with differentCUDA_VISIBLE_DEVICES
. I reinstall the PyTorch from source (ver=1.1.0a0+c3f5ba9) but problem remains. The error is:Thanks, Rui