Closed hadaev8 closed 4 years ago
Im using nvidia apex and torch grad norm. This is grad norm plot with ranger (red) and adamw (blue). https://i.imgur.com/Ui4Sioo.png Is this ok to have so huge grad norm values? Should I turn off grad norming?
Im using nvidia apex and torch grad norm. This is grad norm plot with ranger (red) and adamw (blue). https://i.imgur.com/Ui4Sioo.png Is this ok to have so huge grad norm values? Should I turn off grad norming?