Closed xinsuinizhuan closed 4 years ago
It is not necessary but it is recommended to decrease the step size in gradient descent. (So when the neural network is getting close to the minima it decreases the step-size so as to not jump-over the minima).
A yolo's learning_rate, it decrease at epoche's 80% and 90%, whether it is necessary in our model?