Open linlion0311 opened 1 year ago
This problem occurs in torch\distributed\rendezvous.py
But in this code I can't find any '}' to solve the problem
I try to print (hostname, port, world_size, start_daemon, timeout), But it doesn't look like they have a problem.
Need someone to help, thanks.
system is Windows, should I train it on linux?
I train the model on Linux and it can work!
This problem occurs in torch\distributed\rendezvous.py
But in this code I can't find any '}' to solve the problem
I try to print (hostname, port, world_size, start_daemon, timeout), But it doesn't look like they have a problem.
Need someone to help, thanks.