Open sdbds opened 4 months ago
code:https://github.com/kach/gradient-descent-the-ultimate-optimizer paper:https://arxiv.org/abs/1909.13371
Because this method requires modifying the optimizer so it's not as popular as those common versions, but this repository refactors a lot of optimizers so maybe it can be implemented.
code:https://github.com/kach/gradient-descent-the-ultimate-optimizer paper:https://arxiv.org/abs/1909.13371
Because this method requires modifying the optimizer so it's not as popular as those common versions, but this repository refactors a lot of optimizers so maybe it can be implemented.