-
Hi,
Could you please point out where the gumbel softmax is applied in the code? I am unable to find it.
Thanks
-
Hi, I want to ask that what is the strategy at inference stage ? The method utilizes gumbel softmax to generate preserve token mask at training stage, but how to generate preserve token mask at infere…
-
Thanks for the code! I'm trying to learn Julia and Flux.jl but I'm having trouble finding an example of a VAE with gumbel-softmax trick. Chances are that you have already done this. Could you provide …
-
Hi, I find the code in rlattention.py : 'from thumt.layers.gumbel import gumbel_softmax' But in the layers fload, there isn't the Class gumbel ?
-
Could you please share details on gumbel softmax?
How did you incorporate in your project?
Parameters like tau, hard etc?
-
The equation (1) in this [paper](https://arxiv.org/pdf/1611.01144.pdf) gives the equation of a sampler that samples from a categorical distributions. The advantage of this discrete sampler over the ot…
-
Hello, I have questions on exploration and Gumbel-Softmax.
In the pseudocode, it mentioned initialize random process N for action exploration, which is same in the paper of DDPG. But I have difficu…
-
Hi, nice work! I am a bit confused about gumbel softmax. You mention in your paper that, during traininig, gumbel softmax is used. I wonder if it can be replaced by pure softmax (i.e. torch.softmax)? …
-
It doesn’t look like the temperature is annealed in your gumbel softmax. Is there a reason for this as it is not standard? @tkipf
-
For many bayesian scientists [and for one of my recent application domains] there's recently been a lot of hype around [this article](https://arxiv.org/abs/1611.01144) about how to learn categorical v…