vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase
MIT License
145 stars 7 forks source link

Pass `eps` to adam optimizer and correct minor typos #23

Closed liutianlin0121 closed 11 months ago

liutianlin0121 commented 12 months ago