Open mmmeiko7 opened 3 years ago
Why the learning rate in the log starts at 1.6e-10, then grows to the lr we set(0.0261) and keeps the same?
Why the learning rate in the log starts at 1.6e-10, then grows to the lr we set(0.0261) and keeps the same?