A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
93
stars
4
forks
source link
Add "RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness" #1
Closed
dschaehi closed 1 month ago
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness https://arxiv.org/abs/2405.17220