dongyh20 / Insight-V

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
82 stars 3 forks source link