Open selalipop opened 5 days ago
Yes I have been facing the same issues. I am thinking of training in parts (n-steps) and evaluating at the end of the steps and continuing the training manually for more steps until the Eval and train loss values permit.
Will investigate this asap!
Evaluations are being run, but no validation loss is logged or sent to WandB
The console shows that eval is running, but displays a table along the lines of:
WandB shows evidence validation run occurs, but doesn't display loss either:
Very similar settings work when using plain SFTTrainer in another project