Open 299792459b opened 10 months ago
Red is training loss Orange is valid loss
Other datasets show same behavior
Training loss fluctuates up and down drastically. Anyone know why is this happening? Thank you
Try lowering learning rate.
which parameter in config is actually LR?
Red is training loss Orange is valid loss
Other datasets show same behavior
Training loss fluctuates up and down drastically. Anyone know why is this happening? Thank you