Closed zazyzaya closed 3 years ago
In both training examples clip_grad_norm() is called after opt.step() so it isn't doing anything. It should either be removed or moved before the optimizer updates the model components
clip_grad_norm()
opt.step()
In both training examples
clip_grad_norm()
is called afteropt.step()
so it isn't doing anything. It should either be removed or moved before the optimizer updates the model components