Closed Omarito2412 closed 4 years ago
Techniques mentioned during the discussion:
1- Learning Rate Finder https://www.pyimagesearch.com/2019/08/05/keras-learning-rate-finder/
2- Warm-ups and restarts https://ruder.io/deep-learning-optimization-2017/index.html#warmrestarts https://openreview.net/forum?id=r14EOsCqKX
3- Curriculum Learning for NLP https://www.aclweb.org/anthology/N19-1119.pdf
Please join us in our discussion of "An overview of gradient descent optimization algorithms" on Saturday at 9:00 pm GMT+2. Blog post: https://ruder.io/optimizing-gradient-descent/index.html