OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
https://openrlhf.readthedocs.io/
Apache License 2.0
1.71k stars 160 forks source link

可以增加支持SimPO吗 #311

Open victorShawFan opened 1 month ago

victorShawFan commented 1 month ago

SimPO https://arxiv.org/html/2405.14734v1 https://github.com/princeton-nlp/SimPO/tree/main

victorShawFan commented 2 weeks ago

我在个人仓库代码中已完成修改,支持simpo

victorShawFan commented 2 weeks ago

我可以把我修改的增加了simpo 版本的Openrlhf contribute给这个repo吗?为开源做贡献