Open proshanm opened 4 months ago
You've chosen to report an unexpected problem or bug. Unless you already know the root cause of it, please include details about it by filling the issue template. The following information is missing: "Instructions To Reproduce the Issue and Full Logs"; "Your Environment";
Hi, Please Check: How to fix a SIGSEGV in pytorch when using distributed training (e.g. DDP)? Process 3 terminated with signal SIGSEGV
Thank You
Instructions To Reproduce the Issue:
to speedup training, I add torch.compile operation after DistributedDataParallel in detectron2/engine/defaults.py:
And I trained ViTDet model, it terminated abnormally with following exception message:
Environment:
I installed detectron2 in ubuntu with pytorch 2.1