issues
search
vwxyzjn
/
lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
MIT License
145
stars
7
forks
source link
Use tensorflow-style adam
#3
Closed
vwxyzjn
closed
1 year ago