Computing the semantic entropy

Hi @lorenzkuhn,

I have a concern regarding the computation of the semantic entropy as part of the function get_predictive_entropy_over_concepts:

llh_shift = torch.tensor(5.0) aggregated_likelihoods = torch.tensor(aggregated_likelihoods) - llh_shift entropy = - torch.sum(aggregated_likelihoods, dim=0) / torch.tensor(aggregated_likelihoods.shape[0])

In this case, aggregated_likelihoods represents the log likelihoods of the semantic sets: log(p(c|x)), shifted by the constant llh_shift. The entropy is then computed as the average negative log likelihood across the semantic sets, which does not follow the standard concept of entropy -sum[p(c|x)*log(p(c|x))]. Could you elaborate on this in more detail?

lorenzkuhn / semantic_uncertainty

Computing the semantic entropy #9