Closed jiashenggu closed 3 months ago
Hi, great work! I see you will release training code. How about reward model dataset or reward model?
加一
Thank you for your interest. We have released the fine-tuning code, fine-tuning prompts, and the checkpoint of reward models
刚刚评论就发现已经released wow~ ⊙o⊙
Hi, great work! I see you will release training code. How about reward model dataset or reward model?