Categorical Reparameterization with Gumbel-Softmax comments

你好，看了你的博客，我很有收获。对于Gumbel softmax，我的理解是可以通过调节温度的变化，将分布向均匀或onehot的形式调整，但是总体分布还是和原来分布相符合的（比如在非极端的温度下，最大值的位置应该依然保持不变）。但是在实际动手尝试调用pytorch的时候，发现使用Gumbel softmax前后，分布有较大改动，代码和输入如下。 logits = torch.randn(1, 5) logits = torch.softmax(logits,dim=-1) print(logits) soft = F.gumbel_softmax(logits, tau=1, hard=False) print(soft)

tensor([[0.2096, 0.1859, 0.1033, 0.1246, 0.3767]]) tensor([[0.0823, 0.1950, 0.1542, 0.4819, 0.0865]])

我对结果感到很困惑，请问是我对Gumbel softmax的理解存在误差吗？

Owen-Liuyuxuan / papers_reading_sharing.github.io

Categorical Reparameterization with Gumbel-Softmax comments #5

Categorical Reparameterization with Gumbel-Softmax - Reading Collections