Closed chenxd1996 closed 5 months ago
Hey! Can you share your training parameters? Like number of epochs, learning rate, etc. I think by default the notebook doesn’t train for long, so that could be an issue
Hey! Can you share your training parameters? Like number of epochs, learning rate, etc. I think by default the notebook doesn’t train for long, so that could be an issue
Oh, I see! I've been using the default training parameters. That might be the issue then. Could you please advise on the optimal settings for the training parameters for better performance?
In the original work I trained for 50k iterations with a batch size of 4 (so the model ends up seeing 200,000 samples). Although I think you can get away with a lot less
Closed this, feel free to open it if you have more questions!
Hello,
I have been using the Training notebook provided in this repository to train my model, and I've encountered an issue where the performance of the trained model is significantly subpar.