Closed Bismuth209 closed 9 months ago
I also encountered this problem during the initial training. Could you tell me your detailed training config?
It's not much different from what you used.
guoqin@stu.pku.edu.cn
I have a couple of questions
How do you verify if the first stage has passed?
Test the results on the validation set.
can you show sample results that you received? I have shared what I was getting at the top of the comments
I use the latest training code, but the loss does not decrease
After training stage-1 for 30000 steps on TikTok dataset I'm getting the following loss curve and images from
validation_pipeline
is this correct?