mengdi-li / awesome-RLAIF

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
Apache License 2.0
138 stars 4 forks source link