Closed tangdong1994 closed 2 weeks ago
when i input eg: video = torch.randn(4,16,8,512,512) noisylatent, , , = get_vae_latent(video) noisy_latent[0][0].shape = 1,16,8,128,128 I am puzzled as to why the batch size I input is 4, but the resulting noisy latent batch size becomes 1.
Ah, I get it, it has been transformed into a list of length 4.
when i input eg: video = torch.randn(4,16,8,512,512) noisylatent, , , = get_vae_latent(video) noisy_latent[0][0].shape = 1,16,8,128,128 I am puzzled as to why the batch size I input is 4, but the resulting noisy latent batch size becomes 1.