Closed mlw214 closed 3 years ago
@mlw214 Hi Miller! Thanks for the PR! I realized the way I had things setup didn't make it easy for people to change the survival probability on a schedule, so I rewrote a bunch of code. Thank you for bringing this to my attention anyways!
Roger that! The changes you added in addition to this are great! Looking forward to playing around with this more
This PR adds support for stochastic depth, which is used in the paper for the vision experiments. I went ahead an added it to
gMLP
as well for completeness.I tried my best to match your style. Let me know if there are any problems, or if you want me to refactor anything.