Open Lee-Siyoung opened 1 year ago
Hi @Lee-Siyoung
trainer = pl.Trainer(gpus = -1,
accelerator='ddp',
check_val_every_n_epoch=10,
# precision=16,
# auto_scale_batch_size='binsearch',
callbacks=[checkpoint_callback],
max_epochs = 1)
I hope your trainer code looks like this after trainer.fit(model)
you're getting
RuntimeError: No rendezvous handler for env://
Because you are on Windows.
accelerator='ddp' will not work on windows, you have to choose 'dp'.
I think it will work..
Try it and let me know.
Thankyou :)
Thank you for your answer. Can you tell me which file that code is in? I looked it up, but it wasn't there...😢
@Lee-Siyoung Can u share me ur git link code for the project so that I can better understand
@rohanpatankar926 rohanpatankar926I didn't create git separately because I only changed the yaml here in git yolov7-pose. When I tried using colab, I solved the above error. Do you know what to do if you want to do more than 17 key points? I know that this git is hard-coded with 17.
@rohanpatankar926 Thanks for your reply! In which file can we change the accelerator? I mean in Yolov7 project, where can I change the accelerator to run successfully in Windows?
I only use one gpu.
torch.cuda.is_available() is True
train command
python -m torch.distributed.launch --nproc_per_node 1 --master_port 9527 train.py --data data/coco_kpts_samw.yaml --cfg cfg/yolov7_samw.yaml --weights weights/yolov7-w6-person.pt --batch-size 128 --img 960 --kpt-label --sync-bn --device 0 --name yolov7-w6-pose --hyp data/hyp.pose.yaml
error photo
I don't know how to solve this error, please help me 😥