Closed GeorgiosSmyrnis closed 3 weeks ago
Due to the ordering of the loss resets, training would not actually exit if a NaN value was encountered. This PR fixes this.
Due to the ordering of the loss resets, training would not actually exit if a NaN value was encountered. This PR fixes this.