gumbel-softmax Search Results

394 results
for gumbel-softmax

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HEmile/storchastic #102

comparison between gumbel softmax, rebar, and relax

Hi, @HEmile , I test the example in examples/vae/discrete_vae.py, but find the gumbel softmax performs much better than rebar, relax and reinforce (testing loss after 10 epochs: 98 for gumbel, 165 for…

fnzhan updated 2 years ago
4
chaoshangcs/GTS #10

Question about Gumbel sampling

I read that you apply a bivariate gumbel sampling in your paper, and use the generalized form gumbel softmax. Gumbel softmax takes logits (log probability) as input, while you directly use learned st…

ThinkNaive updated 3 years ago
1
megvii-research/TPS-CVPR2023 #3

Question about the TPS

` class TPS(M.Module): def __init__(self, variant='dTPS'): ... def forward(self, reserved, pruned, now_reserved_policy, now_pruned_policy): ... B, N, _ = reserve…

King4819 updated 1 month ago
1
ericjang/gumbel-softmax #9

logits_py incorrect in gumbel_softmax_vae_v2

`logits_py` should be the log of the current `logits_py` or we can just give it as `probs` and not `logits` to the corresponding distribution.

backpropper updated 5 years ago
1
rl-2023/rl-2023-final-project #2

Agents don't learn

Even after a bigger run, agents don't learn: according to the pressurplate we have a reward in [-0.9,0] if the agent is in the same room of the assigned plate and reward [-1,...,-N] otherwise. I tri…

MicheleMusacchio updated 1 month ago
2
pyro-ppl/numpyro #559

Implement `RelaxedOneHotCategoricalStraightThrough`

Following #548 discussion, and while we wait for discrete latent variables, it would be nice to have a Gumbel-Softmax categorical approximation as featured in Pyro. Didn't realize this was the name gi…

rtbs-dev updated 4 years ago
3
xiaojiew94/KDGAN #2

TensorFlow version for MNIST?

What TensorfFlow version is needed to run the MNIST example? Thank you!

florinmatei updated 5 years ago
1
google-research/electra #87

Sampling step?

Thanks for your documentation and transparency here Quick question, in `sample_from_softmax(logits, disallow=None)`, you return: `tf.one_hot(tf.argmax(tf.nn.softmax(logits + gumbel_noise), -1, …

anshulsamar updated 10 months ago
2
shariqiqbal2810/maddpg-pytorch #32

Updat value function with different action types, why?

Hi Shariq, In your code you update the value function with actions computed by: 1) [gumbel_softmax](https://github.com/shariqiqbal2810/maddpg-pytorch/blob/40388d7c18e4662cf23c826d97e209df9003d86c/…

tessavdheiden updated 3 years ago
1
pytorch/pytorch #77053

Incorrect documentation in ``gumble_softmax`` function.

### 📚 The doc issue [Gumbel-Softmax documentation](https://pytorch.org/docs/stable/generated/torch.nn.functional.gumbel_softmax.html) states that the ``logits`` argument should be unnormalized. Howev…

miltonllera updated 11 months ago
1

上一页 1...1 2 3 4 5 6 7...40 下一页

394 results for gumbel-softmax

394 results
for gumbel-softmax