RifleZhang / LLaVA-Reasoner-DPO

29 stars 1 forks source link

readme

Unofficial Repo for LLaVA-Reasoner-DPO

This is an unofficial repo for the paper: Improve Vision Language Model Chain-of-thought Reasoning

Release

[10.22] we will provide third party implementation for arxiv paper