Closed limhasic closed 4 months ago
But what can I do with this?
Are there any examples of its use?
Hello! The main use case we focus on in the paper is training large multimodal models that reason about both image and text inputs. A few great examples of models that use this type of interleaved data are MM1 and Idefics2.
thank you
But what can I do with this?
Are there any examples of its use?