Use PEFT or Full-parameter to finetune 300+ LLMs or 80+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
3.41k
stars
292
forks
source link
Support RLAIF-V #981
Closed
choyakawa closed 1 month ago
Describe the feature Please describe the feature requested here(请在这里描述需求)
RLAIF-V effectively reduce the hallucination of different MLLMs
Paste any useful information Paste any useful information, including papers, github links, etc.(请在这里描述其他有用的信息,比如相关的论文地址,github链接等)
https://github.com/RLHF-V/RLAIF-V/