Open woshizouguo opened 1 week ago
@wenxindongwork I suspect we will need to have our own prediction_step
method as we use our own datacollator instead of the default one, and the tests didn't catch this bug since the eval_steps
in the tests were > the max_steps
so it never ran the evaluation...
System Info
trl=0.11.2
Information
Tasks
examples
folderReproduction
for online dpo code
If I add
--eval_steps=5
and--eval_strategy=steps
, it shows error:Expected behavior
The eval is not using the correct input dataset.