Open enolan opened 3 years ago
working with pixels we want to output a distribution - let's say it's a multivariate normal. We can then implement an equivalent to top-p sampling by transforming the output distributions into truncated normal distributions, using the cdf to find the correct truncation point. See here for p = 0.05.
This becomes more complicated if we make the r/g/b covariance non-identity, if it's hard I think it's fine to constrain to identity covariance since we're sampling multiple times with MC-dropout anyway.
should get a big improvement from working from pixels rather than vqgan