Ree1s / IDM

279 stars 20 forks source link

a bug of distributed #18

Open kangxi1818 opened 11 months ago

kangxi1818 commented 11 months ago

when i run this command: CUDA_VISIBLE_DEVICES=5 python -m torch.distributed.launch idm_main.py -p train -c config/ffhq_liifsr3_scaler_16_128.json -r /home/sx/sx_data/IDM-main/checkpoints/face/home/sx/sx_data/IDM-main/checkpoints/face this error occurs, how can i fix it? image and when i run this command: CUDA_VISIBLE_DEVICES=5 python -m torch.distributed.launch idm_main.py --local_rank=0 -p train -c config/ffhq_liifsr3_scaler_16_128.json -r /home/sx/sx_data/IDM-main/checkpoints/face/home/sx/sx_data/IDM-main /checkpoints/face this error occurs, image

Lee-zl commented 7 months ago

I meet the same problem, and I think this is because the command "python setup.py develop" do not run correctly.