amazon-science / tabsyn

Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""
Apache License 2.0
75 stars 27 forks source link

Should z (input of the diffusion process) be a sampled version or deterministic #9

Open JiajianLu opened 6 months ago

JiajianLu commented 6 months ago

Hi, from this line https://github.com/amazon-science/tabsyn/blob/main/tabsyn/vae/model.py#L367, each data x is encoded to a fixed z since it's only using the mu encoder. Should it be a sampled version? In other latent diffusion implementation, it seems a sampled version is used. thanks

hengruizhang98 commented 6 months ago

I didn't try using the sampled version. There might be slights gap between their empirical performance. Thanks for your question and I will try it later.