Compound probability distribution
In probability theory, a compound distribution is the probability distribution that results from assuming that a random variable is distributed according to some parameterized distribution with an unknown parameter that is distributed according to some other distribution , and then determining the distribution that results from marginalizing over (i.e. integrating the unknown parameter out). The resulting distribution is said to be the distribution that results from compounding with . Typically, is assumed to be the conjugate prior of .
Examples:
- Compounding a Gaussian distribution with mean distributed according to another Gaussian distribution yields a Gaussian distribution.
- Compounding a Gaussian distribution with precision (inverse variance) distributed according to a gamma distribution yields a three-parameter Student's t distribution.
- Compounding a binomial distribution with probability of success distributed according to a beta distribution yields a beta-binomial distribution.
- Compounding a multinomial distribution with probability vector distributed according to a Dirichlet distribution yields a multivariate Polya distribution, also known as a Dirichlet compound multinomial distribution.
Note that the support of the resulting compound distribution is the same as the support of the original distribution . For example, a beta-binomial distribution is discrete just as the binomial distribution is (however, its shape is similar to that of a beta distribution). The variance of the compound distribution is typically greater than the variance of the original distribution . The parameters of include the parameters of and any parameters of that are not marginalized out. For example, the beta-binomial distribution includes three parameters, a parameter (number of samples) from the binomial distribution and shape parameters and from the beta distribution.
A related but slightly different concept of "compound" occurs in the compound Poisson distribution, which is actually a set of distributions defined by a parameter that is itself a distribution. These distributions result from considering a set of independent identically-distributed random variables distributed according to and asking what the distribution of their sum is, if the number of variables is itself an unknown random variable distributed according to a Poisson distribution and independent of the variables being summed. In this case the random variable is marginalized out much like above is marginalized out.