issues
search
RERV
/
VDT
[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.
Other
194
stars
9
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Question about the `VDT`
#13
zhang-haojie
closed
1 month ago
0
How to make text to video diffusion network?
#12
ersanliqiao
opened
3 months ago
1
GPU computer capability
#11
MeL0XIA
closed
1 month ago
0
文中的Mask机制,在代码中对不上
#10
Nutingnon
opened
3 months ago
2
Physion inference with less than 8 condition frames
#9
aweitz
opened
3 months ago
4
test
#8
gggxxx1234
closed
3 months ago
2
不吹不擂,分析一下VDT和Sora之间的差别,顺Genie继续往远眺望...
#7
yuedajiong
opened
4 months ago
0
Training Code and Dataset format?
#6
BingliangLi
opened
4 months ago
4
any diff with https://github.com/VDT-2023/VDT?
#5
nemonameless
closed
4 months ago
1
More Physion evaluation results
#4
DylanTao
closed
8 months ago
1
Cityscapes pretrained model
#3
JunyaoHu
closed
8 months ago
1
Some confusion about the code.
#2
jiangchaokang
closed
8 months ago
1
Would you release the training code?
#1
zen-d
closed
8 months ago
3