Closed ydhongHIT closed 5 years ago
At the first 500 steps, the learning rate is only one-tenth of the initial learning rate. After that, learning rate is ten times bigger and the same as the initial value. Why?
At the first 500 steps, the learning rate is only one-tenth of the initial learning rate. After that, learning rate is ten times bigger and the same as the initial value. Why?