Open yamins81 opened 5 years ago
Current solution for correct weight decay on multi-GPUs is awkward unless you use regularizers. But what's the generic solution?
Current solution for correct weight decay on multi-GPUs is awkward unless you use regularizers. But what's the generic solution?