Open mathDR opened 2 months ago
Hey, any update on it since the PR has been closed? :) thanks !
Apologies. I closed this because It was just easier to "begin again". I have a draft PR locally that I was going to push this week.
Awesome, good to know! Thx for your work 👌
The AdeMAMix optimizer is a simple modification of the Adam optimizer with a mixture of two EMAs to better take advantage of past gradients.
The paper has
optax
skeleton code which I could contribute if the maintainers deem this a good fit for the repo.