RERV / VDT

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.
Other
194 stars 9 forks source link

How to make text to video diffusion network? #12

Open ersanliqiao opened 3 months ago

ersanliqiao commented 3 months ago

怎么改造这个网络,可以实现文生视频呢?

RERV commented 1 month ago

Hi, my apologies for the late reply. I think this repo will be helpful to you (https://github.com/maxin-cn/Latte)