OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
https://arxiv.org/abs/2405.11143
Apache License 2.0
1.69k stars 154 forks source link