Code question - Githubissues

Hi, thanks for your interest in our project.

Eq.12 means that in some regions of the image, the pixels come from the warped image, while in other regions, the pixels are directly generated from the diffusion model. In latent space, this means, in some regions of the image, the feature come from temp_cond_latents and other generated from diffusion model. The variable λ is used to compute the mask that represents the regions top_masks where the temp_cond_latents is used. https://github.com/ZHU-Zhiyu/NVS_Solver/blob/e8433d60a01eccf7cd967281ec42911aa843c4f2/src/diffusers/schedulers/scheduling_euler_discrete.py#L930-L958

ZHU-Zhiyu / NVS_Solver