Vision-CAIR / VisualGPT

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models
MIT License
316 stars 49 forks source link