mengdi-li / awesome-RLAIF

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
Apache License 2.0
93 stars 4 forks source link

Add "RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness" #1

Closed dschaehi closed 1 month ago

dschaehi commented 1 month ago

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness https://arxiv.org/abs/2405.17220

mengdi-li commented 1 month ago

Thanks!