issues
search
natolambert
/
rlhf-book
Textbook on reinforcement learning from human feedback
https://rlhfbook.com/
MIT License
69
stars
7
forks
source link
Update workflows
#5
Closed
natolambert
closed
5 months ago