mindspore-lab / mindrlhf

Apache License 2.0
26 stars 12 forks source link

update docs of reward model and rlhf dataset #35

Closed KerryKou closed 11 months ago

KerryKou commented 11 months ago

Update rlhf data processing scripts README doc.