natolambert / rlhf-book

Textbook on reinforcement learning from human feedback
https://rlhfbook.com/
MIT License
69 stars 7 forks source link

Rejection Sampling Writing #11

Closed natolambert closed 2 months ago