google-research / pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
Apache License 2.0
857 stars 71 forks source link

scale factor b #54

Open zhangdan8962 opened 6 months ago

zhangdan8962 commented 6 months ago

@chentingpc You mentioned scale the input image x by factor b when we calculate the m_bits in Algorithm 1. I am wondering if we need to scale it back when we predict m_0 from m_t during sampling because it is not so obvious in your Algorithm 2. Did you actually implement it in DDIM sampler? I have similar question regarding your another paper On the Importance of Noise Scheduling for Diffusion Models.

If such inverted scaling is not necessary, could you please explain the reason behind this?