Feature: Define a set of default data formats for OpenRLHF to reduce the cost of using custom data for everyone.

OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

https://openrlhf.readthedocs.io/

Apache License 2.0

1.73k stars 164 forks source link

Closed catqaq closed 5 days ago

catqaq commented 6 days ago

hijkzzz commented 5 days ago

Feel free to reopen this issue if there are any questions.