openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences
https://openai.com/blog/fine-tuning-gpt-2/
MIT License
1.24k stars 164 forks source link