Closed CYYAI closed 6 months ago
Has the training diverged? You can debug to see if there are any NaN values in the model's output layer?
Using the model you gave me, and then infer directly
The checkpoint weights did not load correctly.