IIC-SIG-MLsys / HDDT

Distrubuted DNN Training on Heterogeneous GPUs
0 stars 6 forks source link

Message Synchronization Protocol #35

Open derekwin opened 1 week ago

derekwin commented 1 week ago

rendezvous protocol : we use two-sided RDMA SEND for rendezvous messages, and we use one-sided RDMA WRITE for actual message transmission.