First, I executed the code according to the process in the repo and got roleembd.pth and ckpt.
However, the loss in the training stage is a lot bit strange. The loss changes very little, but it increases sharply in the last stage and then drops to 0.
Second, In the infer.py stage, the outputs tensor is all 1 ().
So how should I solve this problem? Should I retrain the model?
First, I executed the code according to the process in the repo and got roleembd.pth and ckpt. However, the loss in the training stage is a lot bit strange. The loss changes very little, but it increases sharply in the last stage and then drops to 0.
Second, In the infer.py stage, the outputs tensor is all 1 (
).So how should I solve this problem? Should I retrain the model?