openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences
https://openai.com/blog/fine-tuning-gpt-2/
MIT License
1.24k stars 164 forks source link

error when training a reward model #5

Open Yuminzhou opened 4 years ago

Yuminzhou commented 4 years ago

image

error as above, thanks.