issues
search
vwxyzjn
/
lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
MIT License
152
stars
7
forks
source link
Use `untrained_model` for normalize
#10
Closed
vwxyzjn
closed
1 year ago
vwxyzjn
commented
1 year ago
Closes #8
Closes #8