issues
search
yonseivnl
/
vlm-rlaif
ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
Apache License 2.0
52
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Trying Inference on TempCompass benchmark
#10
yogkul2000
opened
1 month ago
1
merge from original
#9
dcahn12
closed
3 months ago
0
Update README
#8
Yuuraa
closed
4 months ago
0
Upload RLAIF Trianing Script
#7
Yuuraa
closed
4 months ago
0
Look forward to your codes!
#6
yepzhang
closed
4 months ago
1
Add Evaluation Code
#5
Yuuraa
closed
4 months ago
0
Update Evaluation Code
#4
Yuuraa
closed
4 months ago
0
Add RLAIF training code
#3
Yuuraa
closed
5 months ago
0
AI-generated preference annotations may be noisy
#2
hlchen23
closed
4 months ago
2
Look forward to you code!
#1
ZiruiSongBest
closed
4 months ago
1