ih4cku / blog

deprecated, Git issues are great for writing blogs :)

2 stars 0 forks source link

Open ih4cku opened 8 years ago

ih4cku commented 8 years ago

vanishing and exploding gradient / sensitivity

(must see) X. Glorot and Y. Bengio. Understanding the difficulty of trainingdeep feedforward neural networks. InAISTATS, 2010.
(must see) Pascanu, Razvan, Tomas Mikolov, and Yoshua Bengio. "On the difficulty of training recurrent neural networks." ICML (3) 28 (2013): 1310-1318.
Why are deep neural networks hard to train?

ih4cku commented 8 years ago

ih4cku commented 8 years ago

following works: