Open psuwannapich opened 1 year ago
Hello, I am having the same issue. I am trying to finetune it using a German language dataset and it is giving me almost exactly the same picture that you got.
Were you able to find a solution? (or explanation)
Thank you.
I think this come from the model loss function from e2e-coref
If you get the loss curve like I did, it's fine since the model already converged. (According to my curve, it start to converge after step 7,000) But if the loss remain 0 after the training is conclude, I suggest to change your hyperparameters (or may be using new seed) and train your model again several times.
By the way, the best way to fix these is to fix the loss function to cover true negative case.
Hello, I try to finetune your model in another language but the loss of model both train and val set are 0 for first few epoches.
The loss curve
Is this normal or I do anything wrong?