mengdi-li / awesome-RLAIF

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
Apache License 2.0
124 stars 4 forks source link

Update the abstract and publish info of RLC paper #2

Closed lafmdp closed 3 months ago

lafmdp commented 3 months ago

[ICLR'24] Language Model Self-improvement by Reinforcement Learning Contemplation

mengdi-li commented 3 months ago

Thanks for the update!