Open SekeunKim opened 1 week ago
In this paper, it looks it can generate next token and decode to image for video generation. In demo, it only has generating image based on text. Is that correct ?
Thank you for great paper.
In this paper, it looks it can generate next token and decode to image for video generation. In demo, it only has generating image based on text. Is that correct ?
Thank you for great paper.