vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase
MIT License
145 stars 7 forks source link

Use tensorflow-style adam #3

Closed vwxyzjn closed 1 year ago