Closed Little-Podi closed 10 months ago
Yes, we use DDIM inversion.
I see. Then, what is the diffusion model used to conduct the inversion? From my understanding, if the single-image diffusion model is used, it cannot faithfully reproduce the conditional view, as the final DDIM sampling is processed by the multi-view diffusion model. Am I right?
Sorry, I made a mistake in the last reply. We did not use DDIM inversion. We
similar to the inpainting process.
Sounds effective. Thanks for your detailed reply.
Hi, congrats for your excellent work! I have a question regarding view-conditioned generation in Fig. 6: I am wondering how the condition view image is provided to the denoising process. Is it generated by DDIM inversion?