Closed Yancey1989 closed 4 months ago
To optimize distributed training graph (DP, FSDP), DISC needs to support collective ops as a preliminary preparation
To optimize distributed training graph (DP, FSDP), DISC needs to support collective ops as a preliminary preparation
1275
1288
1287