How Do Diffusion Models Maintain Consistency Across Views After Denoising Without Specific Conditions?

roykapon / MAS

The official implementation of the paper "MAS: Multiview Ancestral Sampling for 3D Motion Generation Using 2D Diffusion"

MIT License

97 stars 3 forks source link

Hi there @2019211753 ! This is exactly the beauty of the method! It does not require viewing angle conditioning. What keeps all views consistent with each other is our consistency block, which takes the motion predictions in each stage (which are not necessarily multiview-consistent), triangulates them into a single 3D motion, then projects them back to all views and replaces the original predictions with the projections. The projections are used in the diffusion process in each view so the entire process remains multiview-consistent. The 3D noise provides a crucial boost to the coordination between all views, but the consistency block is the one that keeps all views together.

roykapon / MAS

How Do Diffusion Models Maintain Consistency Across Views After Denoising Without Specific Conditions? #8