Closed lmccalman closed 6 years ago
We need to scale weight layers depending on the size of the layer inputs and outputs. Initialization matters a lot!
We need to scale weight layers depending on the size of the layer inputs and outputs. Initialization matters a lot!