f1yfisher / DriveDreamer2

103 stars 3 forks source link

Some question about training details #2

Open wjmsxbd opened 1 week ago

wjmsxbd commented 1 week ago

Thanks for your awesome work! In your paper, you mentioned that the data obtained was 12Hz, but during training, the input data was also 12Hz or 4Hz, because in the later implementation details, it was mentioned that the FVD was calculated at 4Hz. Looking forward to your reply!

f1yfisher commented 6 days ago

During model training, we employed a frame sampling operation to ensure the motion amplitude of the generated videos, ultimately using data at a frequency of 4 Hz for training. As demonstrated on our project homepage, the generated short videos are 2 seconds long, which corresponds to 8 frames at 4 Hz.