Closed mannatsingh closed 3 years ago
Summary: The loss being NaN / inf during training is problematic since we cannot backprop using an invalid loss. This isn't the case during an eval phase and we don't need to crash.
Differential Revision: D28204666
This pull request was exported from Phabricator. Differential Revision: D28204666
Summary: The loss being NaN / inf during training is problematic since we cannot backprop using an invalid loss. This isn't the case during an eval phase and we don't need to crash.
Differential Revision: D28204666