issues
search
RERV
/
VDT
[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.
Other
211
stars
13
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
temporal_fc in VDTBlock
#14
zhang-haojie
opened
1 month ago
0
Question about the `VDT`
#13
zhang-haojie
closed
6 months ago
0
How to make text to video diffusion network?
#12
ersanliqiao
opened
8 months ago
1
GPU computer capability
#11
MeL0XIA
closed
6 months ago
0
文中的Mask机制,在代码中对不上
#10
Nutingnon
opened
8 months ago
2
Physion inference with less than 8 condition frames
#9
aweitz
opened
8 months ago
5
test
#8
gggxxx1234
closed
8 months ago
2
不吹不擂,分析一下VDT和Sora之间的差别,顺Genie继续往远眺望...
#7
yuedajiong
opened
9 months ago
0
Training Code and Dataset format?
#6
BingliangLi
opened
9 months ago
4
any diff with https://github.com/VDT-2023/VDT?
#5
nemonameless
closed
9 months ago
1
More Physion evaluation results
#4
DylanTao
closed
1 year ago
1
Cityscapes pretrained model
#3
JunyaoHu
closed
1 year ago
1
Some confusion about the code.
#2
jiangchaokang
closed
1 year ago
1
Would you release the training code?
#1
zen-d
closed
1 year ago
4