Questions about inpaint training code

lucidrains / DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

MIT License

11.15k stars 1.09k forks source link

Questions about inpaint training code #249

Open universewill opened 2 years ago

universewill commented 2 years ago

Hi, i don't quite understand the inpainting training framework.

Is only the decoder need to be trained?
What does the "mock_image_embed" mean below? Is this embed need to be generated from clip and diffusion prior model?

inpainted_images = decoder.sample(
    image_embed = mock_image_embed,
    inpaint_image = inpaint_image,    # just pass in the inpaint image
    inpaint_mask = inpaint_mask       # and the mask
)

zhangchen234 commented 2 years ago

I have the same question.