Rejection Sampling + Run Again - Githubissues

natolambert / rlhf-book

Textbook on reinforcement learning from human feedback

https://rlhfbook.com/

MIT License

69 stars 7 forks source link

Rejection Sampling + Run Again #7

Closed natolambert closed 2 months ago