msr-fiddle / pipedream

MIT License
379 stars 117 forks source link

how does the backward msg get the corresponding old weights in 1F1B-RR #22

Closed gbxu closed 4 years ago

gbxu commented 4 years ago

hi, me again.. First of all, your work is really good, so I read it once more time.


deepakn94 commented 4 years ago

The activations / gradients processed by PipeDream are explicitly controlled in the runtime: receive queues are read in a deterministic order, which means PipeDream is resistant to messages from stage replicas arriving in different orders. As you pointed out, lines 586 and 592 implement this logic.

gbxu commented 4 years ago

thx, deepak!