Add "RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness"

mengdi-li / awesome-RLAIF

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

Apache License 2.0

93 stars 4 forks source link

Closed dschaehi closed 1 month ago

dschaehi commented 1 month ago

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness https://arxiv.org/abs/2405.17220

mengdi-li commented 1 month ago

Thanks!