AkihikoWatanabe / paper_notes

たまに追加される論文メモ
https://AkihikoWatanabe.github.io/paper_notes
19 stars 0 forks source link

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action, Jiasen Lu+, N/A, arXiv'23 #1202

Open AkihikoWatanabe opened 10 months ago

AkihikoWatanabe commented 10 months ago

URL

AkihikoWatanabe commented 10 months ago

画像、テキスト、音声、アクションを理解できる初めてのautoregressive model。AllenAI