What `Distributions.pdf` returns, and when?

aplavin commented 9 months ago

The docstring simply states:

  pdf(d::UnivariateDistribution, x::Real)

  Evaluate the probability density (mass) at x.

This surely isn't enough to know when it returns density, and when mass. I thought I kinda knew the answer to this question, but recently turned out I was wrong.

I've always thought it returns density for Continous distributions, and mass for Discrete. But actually there are distributions for which pdf(d, x) sometimes returns mass and sometimes density – for the same d. One example is censored(...) distribution.

Would be nice to add some clarification to the pdf docstring to make writing generic code feasible. Currently, pdf doesn't guarantee anything useful unless one only considers specific individual distributions. Code like f(d) = ... pdf(d, x) ... without explicit constrains on d is impossible to reason about.

devmotion commented 9 months ago

It always returns "the" probability density, the main question is just with respect to which base measure. Typically for discrete discrete distributions the base measure is the counting measure, which implies that the density function coincides with the probability mass function (see eg https://en.wikipedia.org/wiki/Probability_mass_function#Measure_theoretic_formulation).

aplavin commented 9 months ago

That's what I thought originally: regular reals measure for Continous, counting measure for Discrete. But what the measure is for censored() then?

sethaxen commented 9 months ago

Censored is an example of a probability measure of mixed type (see e.g. https://www.randomservices.org/random/dist/Mixed.html). While a density function is generally a Radon-Nikodym derivative of a probability measure wrt another measure (which is the measure wrt which it is absolutely continuous), I don't think this concept extends easily to mixed type probability measures. However, they have a clear notion of partial density, which depending on the base measure would give either the density of the continuous component or the discrete component.

For mixture distributions in general it might be more sensible to define the density as the probability-weighted sum of densities of the components wrt their respective measures instead of defining as absolutely continuous wrt some base measure. This definition would still support inference using censored data.

aplavin commented 9 months ago

That's a bit over my head... Let me just share my practical concern and motivation for opening this issue.

It was nice to see that Distributions defines the censored distribution with an interface seemingly consistent with all other distributions. But then I noticed that it's basically impossible to use this distribution with a generic function that does some computations based of pdf/logpdf. For example, plugging censored(...) as part of the prior or likelihood into a bayesian analysis procedure would clearly lead to wrong inferences. pdf(d, x) of d = censored(Normal(0, 1), 0, Inf) is effectively indistinguishable from truncated(Normal(0, 1), 0, Inf) aside from 2x different normalization, and inference results will be the same for both.

Do you think it is possible at all to use Distributions.pdf generically? If so, how to avoid this kind of silently wrong results?