terkkila / cgml

Machine Learning with Computational Graphs
Apache License 2.0
2 stars 0 forks source link

Adaptive gradient descent #1

Closed terkkila closed 10 years ago

terkkila commented 10 years ago

Consider implementing Nesterov's accelerated gradient, momentum, and/or variance based adaptation of learning rates. This should make tuning of learning rate less critical to successful learning of model parameters.