Closed YongtaoGe closed 7 months ago
Hey, thank you for your interest in this project! One thing is that we use a fixed timestep 400 in q_sample
, since this is just responsible for noise augmentation. And also, we pass the image also as a conditioning to the UNet. As a result, the second to the last line should be something like pred = UNet(xt, t, rgb_latents)
@YongtaoGe , did you manage to run successful training ? If so, can you please share the code ?
Hi, authors! Thanks for open-source the inference code. I am interested in reimplementing the training process. However, the results look weird. Here is the code snippet I use, it would be appreciated if you could help me check the problem in the code.