Open Bahador-Bakhshi opened 3 years ago
Different approaches can be used for decaying the alpha and epsilon, for example alpha = alpha0 / (1 + iteration * decay)
Have a look at "Learning Rate Scheduling" in the "Hands On ..." book
Different approaches can be used for decaying the alpha and epsilon, for example alpha = alpha0 / (1 + iteration * decay)