RERV / VDT

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.
Other
211 stars 13 forks source link

How to make text to video diffusion network? #12

Open ersanliqiao opened 8 months ago

ersanliqiao commented 8 months ago

怎么改造这个网络,可以实现文生视频呢?

RERV commented 7 months ago

Hi, my apologies for the late reply. I think this repo will be helpful to you (https://github.com/maxin-cn/Latte)