mit-han-lab / vila-u

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
MIT License
146 stars 3 forks source link