Closed chjXu closed 4 days ago
In the dist_train process, it report the errors of _pickle.UnpicklingError: invalid load key, '<' and ERROR:torch.distributed.elastic.multiprocessing.api:failed. How to solve the problem? The environment is installed according to the README file.
In the dist_train process, it report the errors of _pickle.UnpicklingError: invalid load key, '<' and ERROR:torch.distributed.elastic.multiprocessing.api:failed. How to solve the problem? The environment is installed according to the README file.