lucidrains / phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
MIT License
748 stars 79 forks source link

training data #39

Open 23Rj20 opened 7 months ago

23Rj20 commented 7 months ago

for how long and how many videos should i train for good results? As i tried to train it with just two 10 sec videos and the samples it is saving is just noise 8200

snehasree-sony commented 5 months ago

As I understand from the paper for training C-ViViT - MiT dataset is used, for training phenaki what is the dataset used for text to video generation ?