vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase
MIT License
152 stars 7 forks source link

Bug fix / refactor #14

Closed vwxyzjn closed 1 year ago

vwxyzjn commented 1 year ago
liutianlin0121 commented 1 year ago

Awesome, LGTM! Thanks.