mengdi-li / awesome-RLAIF

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
Apache License 2.0
124 stars 4 forks source link