issues
search
natolambert
/
rlhf-book
Textbook on reinforcement learning from human feedback
https://rlhfbook.com/
MIT License
69
stars
7
forks
source link
Rejection Sampling Writing
#11
Closed
natolambert
closed
2 months ago