question regarding diffusion model

revalo / tree-diffusion

Diffusion on syntax trees for program synthesis

MIT License

403 stars 22 forks source link

Thank you! I'm glad you find this work exciting :)

1) The model is given both the current render and the target render. Therefore, the model doesn't really need to know step_i, since that should be derivable from the current and the target. This makes sampling a lot easier as the noise level is estimated by the model.

2) If we just ran the denoising process, we would get a distribution of programs that could satisfy the target. Since our evals our fornal, where we have to match the image almost exactly, we stop. In fact, we can even do tree search on this model.

The method presented here is a discrete diffusion model (kind of like DiGress), where the transformer decoder is just modeling the discrete jumps using an auto regressive approach. But we do add discrete Markovian noise, and we do have a model that learns to undo this noise.

revalo / tree-diffusion

question regarding diffusion model #10