Mrpatekful / swats

Unofficial implementation of Switching from Adam to SGD optimization in PyTorch.
MIT License
65 stars 18 forks source link