Open garrett361 opened 6 months ago
Cross-posting this issue from ipex, in case the torch-ccl team is not aware of it.
ipex
torch-ccl
Key issues:
The pytorch profiler traces highlight the issues (copied from the other thread):
Non-blocking kernel launch and comms/compute overlap.
Blocking kernel launch and no comms/compute overlap.
See the other thread for more details.
Cross-posting this issue from
ipex
, in case thetorch-ccl
team is not aware of it.Key issues:
The pytorch profiler traces highlight the issues (copied from the other thread):
A100 Trace
Non-blocking kernel launch and comms/compute overlap.
Intel Max 1550 Trace
Blocking kernel launch and no comms/compute overlap.
See the other thread for more details.