ChunyuanLI / Optimus

Optimus: the first large-scale pre-trained VAE language model
368 stars 37 forks source link

additional sampling scheme #6

Open vseledkin opened 4 years ago

vseledkin commented 4 years ago

Currently, text is generated from latent point by sampling from distributions produced by generator over vocabulary of tokens. But since z is multivariate gaussian we can also sample from it thus having more diversity and nuances in generated samples.

ChunyuanLI commented 4 years ago

You are right about my current implementation, and I agree it will increase diversity. I‘m bit afraid that adding randomness in latent space may delude the latent trajectory, one has to carefully choose random samples by checking its distance to its Gaussian mean.