Closed lianglia-apollo closed 2 months ago
Hi. Please download all splits of Trainval in Full dataset (v1.0), including the sweeps
set.
To replicate our performance, we recommend also downloading OpenDV-YouTube dataset from DriveAGI if possible. You can start with a few YouTubers like J Utah. Feel free to let us know if you encounter any troubles.
Hi. Please download all splits of Trainval in Full dataset (v1.0), including the
sweeps
set. To replicate our performance, we recommend also downloading OpenDV-YouTube dataset from DriveAGI if possible. You can start with a few YouTubers like J Utah. Feel free to let us know if you encounter any troubles.
Hi, I also want to experiences the pretrained model to see the generated videos based on NuScenes. Howerver I enconter a problem seemingly from dataset.
Loading model from ckpts/vista.safetensors
Global seed set to 23
Traceback (most recent call last):
File "/data/workspace/code/Vista-main/sample.py", line 254, in
I only download the first two of all parts. Could you please provide a more detailed dataset organization structure after that I have downloaded all of the 10 parts of Nuscenes. (I note that all part files have directory'sample','sweep', should I manually merge them into the same directory?)
Hi @MulitiByte, thanks for your interest. Yes, I think you need to merge them all to fit the annotations.
Hi @MulitiByte, thanks for your interest. Yes, I think you need to merge them all to fit the annotations.
Thank you so much for your kind an patient reply. I have successfully tested on my own machine. And I have some more questions, I set the configurations for sampling as follows:
action='free', n_rounds=1, n_frames=16, n_conds=3, seed=46, height=320, width=576, cfg_scale=2.5, cond_aug=0.0, n_steps=50, rand_gen=True, low_vram=False
Some produced videos are a little unnatural, could you please provide some advice acorrding to your practical experiences to help to improve the quality? Thanks a lot ! (BTW: dose the parameter n_conds means using the first three frames as the condition for the whole 16 frames under this config?)
Hi @MulitiByte,
could you please provide some advice acorrding to your practical experiences to help to improve the quality?
I think the main factor is the number of frames you use. It is not recommended to set n_frames
less than 25 if you have enough GPU memory.
dose the parameter n_conds means using the first three frames as the condition for the whole 16 frames under this config?
Yes, you are right!
I'm closing this issue due to its inactivity. Free feel to reopen it if you have further questions.
First, appreciate for open source this project.
Could you elaborate which dataset is needed on https://www.nuscenes.org/download to reproduce this work?