Open xianyi11 opened 2 years ago
When use your code for training model, there is "swa": true in the config file. So, what's the meaning of "swa"?
SWA refers to stochastic weight averaging The implementation is here slightly modified from the pytorch lightening implementation
When use your code for training model, there is "swa": true in the config file. So, what's the meaning of "swa"?