In statistics and econometrics, the multivariate probit model is a generalization of the probit model used to estimate several correlated binary outcomes jointly. For example, if it is believed that the decisions of sending at least one child to public school and that of voting in favor of a school budget are correlated (both decisions are binary), then the multivariate probit model would be appropriate for jointly predicting these two choices on an individual-specific basis.
Example: bivariate probit
In the ordinary probit model, there is only one binary dependent variable
and so only one latent variable
is used. In contrast, in the bivariate probit model there are two binary dependent variables
and
, so there are two latent variables:
and
.
It is assumed that each observed variable takes on the value 1 if and only if its underlying continuous latent variable takes on a positive value:


with

and

Fitting the bivariate probit model involves estimating the values of
and
. To do so, the likelihood of the model has to be maximized. This likelihood is
![{\displaystyle {\begin{aligned}L(\beta _{1},\beta _{2})={\Big (}\prod &P(Y_{1}=1,Y_{2}=1\mid \beta _{1},\beta _{2})^{Y_{1}Y_{2}}P(Y_{1}=0,Y_{2}=1\mid \beta _{1},\beta _{2})^{(1-Y_{1})Y_{2}}\\[8pt]&{}\qquad P(Y_{1}=1,Y_{2}=0\mid \beta _{1},\beta _{2})^{Y_{1}(1-Y_{2})}P(Y_{1}=0,Y_{2}=0\mid \beta _{1},\beta _{2})^{(1-Y_{1})(1-Y_{2})}{\Big )}\end{aligned}}}](/media/api/rest_v1/media/math/render/svg/cf38bbe4b310e647267a617ca2a58c4064eebf5a)
Substituting the latent variables
and
in the probability functions and taking logs gives
![{\displaystyle {\begin{aligned}\sum &{\Big (}Y_{1}Y_{2}\ln P(\varepsilon _{1}>-X_{1}\beta _{1},\varepsilon _{2}>-X_{2}\beta _{2})\\[4pt]&{}\quad {}+(1-Y_{1})Y_{2}\ln P(\varepsilon _{1}<-X_{1}\beta _{1},\varepsilon _{2}>-X_{2}\beta _{2})\\[4pt]&{}\quad {}+Y_{1}(1-Y_{2})\ln P(\varepsilon _{1}>-X_{1}\beta _{1},\varepsilon _{2}<-X_{2}\beta _{2})\\[4pt]&{}\quad {}+(1-Y_{1})(1-Y_{2})\ln P(\varepsilon _{1}<-X_{1}\beta _{1},\varepsilon _{2}<-X_{2}\beta _{2}){\Big )}.\end{aligned}}}](/media/api/rest_v1/media/math/render/svg/06d3be7e06c333f9229c4a72ce2d71f7b8d9b516)
After some rewriting, the log-likelihood function becomes:
![{\displaystyle {\begin{aligned}\sum &{\Big (}Y_{1}Y_{2}\ln \Phi (X_{1}\beta _{1},X_{2}\beta _{2},\rho )\\[4pt]&{}\quad {}+(1-Y_{1})Y_{2}\ln \Phi (-X_{1}\beta _{1},X_{2}\beta _{2},-\rho )\\[4pt]&{}\quad {}+Y_{1}(1-Y_{2})\ln \Phi (X_{1}\beta _{1},-X_{2}\beta _{2},-\rho )\\[4pt]&{}\quad {}+(1-Y_{1})(1-Y_{2})\ln \Phi (-X_{1}\beta _{1},-X_{2}\beta _{2},\rho ){\Big )}.\end{aligned}}}](/media/api/rest_v1/media/math/render/svg/60a4fbbede93a87fb1e470ac115113986d40c3bd)
Note that
is the cumulative distribution function of the bivariate normal distribution.
and
in the log-likelihood function are observed variables being equal to one or zero.
Multivariate Probit
For the general case,
where we can take
as choices and
as individuals or observations, the probability of observing choice
is

, else
Further reading
Greene, William H., Econometric Analysis, seventh edition, Prentice-Hall, 2012.