Closed Rui-Chun closed 5 years ago
logits
are the values you would apply a softmax over to get the probabilities. They're not quite probabilities themselves (in particular, these values might be negative or don't sum to 1), and that's why we use Categorical(logits=logits)
. Categorical
is responsible for applying that softmax operation.
At the end of categorical_mlp.py, the forward function return the above result.
But should not it be " return Categorical(logits) ", since logits means the probability, right?