modelscope / ms-swift

Use PEFT or Full-parameter to finetune 300+ LLMs or 80+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
https://swift.readthedocs.io/zh-cn/latest/Instruction/index.html
Apache License 2.0
3.41k stars 292 forks source link

Support RLAIF-V #981

Closed choyakawa closed 1 month ago

choyakawa commented 3 months ago

Describe the feature Please describe the feature requested here(请在这里描述需求)

RLAIF-V effectively reduce the hallucination of different MLLMs

Paste any useful information Paste any useful information, including papers, github links, etc.(请在这里描述其他有用的信息,比如相关的论文地址,github链接等)

https://github.com/RLHF-V/RLAIF-V/

hjh0119 commented 1 month ago

support