natolambert / rlhf-book

Textbook on reinforcement learning from human feedback
https://rlhfbook.com/
MIT License
21 stars 2 forks source link

Add basic bib #3

Closed natolambert closed 3 months ago

natolambert commented 3 months ago

Three TODOs