RifleZhang / LLaVA-Reasoner-DPO

29 stars 1 forks source link

Unofficial Repo for LLaVA-Reasoner-DPO

This is an unofficial repo for the paper: Improve Vision Language Model Chain-of-thought Reasoning

Release