Open tuqingwen opened 2 months ago
Please provide a clear and concise description of what the question is.
大佬,请问您新增的reward_modeling.py这一脚本是不是也可以用来训练评分器!数据集的形式就和data/reward一样把
可以。
Describe the Question
Please provide a clear and concise description of what the question is.
大佬,请问您新增的reward_modeling.py这一脚本是不是也可以用来训练评分器!数据集的形式就和data/reward一样把