对齐的框架和数据

baichuan-inc / Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

https://huggingface.co/baichuan-inc

Apache License 2.0

4.03k stars 286 forks source link

Open skepsun opened 10 months ago

skepsun commented 10 months ago

看了论文，baichuan2 chat版本做了rlhf流程，采集了类似于hh_rlhf的数据，请问有开源rlhf数据和训练框架的计划吗？或者可以先开源一部分reward model训练数据？

jieli4970 commented 10 months ago

附问一下，chat和base的差别就是加了对其这一步吗