OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
https://openrlhf.readthedocs.io/
Apache License 2.0
1.73k stars 164 forks source link

Feature: Define a set of default data formats for OpenRLHF to reduce the cost of using custom data for everyone. #344

Closed catqaq closed 5 days ago

catqaq commented 6 days ago

related: https://github.com/OpenLLMAI/OpenRLHF/issues/259

hijkzzz commented 5 days ago

this was already implemented by tokenizer.apply_chat_template See https://openrlhf.readthedocs.io/en/latest/quick_start.html#prepare-datasets

Feel free to reopen this issue if there are any questions.