Multinomial test
![]() |
In statistics, the multinomial test is the likelihood-ratio test of the null hypothesis that the parameters of a multinomial distribution equal specified values. It is used for categorical data; see Read and Cressie[1].
We begin with a sample of items each of which has been observed to fall into one of categories. We can define as the observed numbers of items in each cell. Hence .
Next, we define a vector of parameters , where . These are the parameter values under the null hypothesis.
The exact probability of the observed configuration under the null hypothesis is given by
Under the alternative hypothesis, each value is replaced by its unconstrained maximum likelihood estimate and the exact probability of the observed configuration under the alternative hypothesis is given by
The natural logarithm of the ratio between these two probabilities multiplied by -2 is then the likelihood ratio test statistic
If the null hypothesis is true, then as increases, the distribution of converges to that of chisquare with degrees of freedom. However it has long been known (eg Lawley 1956) that for finite sample sizes, the moments of are greater than those of chisquare, thus inflating the probability of Type I errors (false positives). The speed with which the test statistic converges to chisquare is determined by . Williams (1976) showed that it can be made to converge to chisquare at a rate determined by if it is multiplied by a factor given by
In the special case where the null hypothesis is that all the values are equal to 1/k (ie it stipulates a uniform distribution), this simplifies to
Subsequently, Smith et al (1981) derived a multiplier which converges the test statistic at a rate determined by though this is more complicated to apply in practice. For the case of equal vales of , the multiplier is
References
- ^ Read, T. R. C. and Cressie, N. A. C. (1988). Goodness-of-fit statistics for discrete multivariate data. New York: Springer-Verlag. ISBN 0-387-96682-X.
- Lawley, D. N. (1956). "A General Method of Approximating to the Distribution of Likelihood Ratio Criteria". Biometrika. 43: 295–303.
- Smith, P. J., Rae, D. S., Manderscheid, R. W. and Silbergeld, S. (1981). "Approximating the Moments and Distribution of the Likelihood Ratio Statistic for Multinomial Goodness of Fit". Journal of the American Statistical Association. 76: 737–740.
{{cite journal}}
: CS1 maint: multiple names: authors list (link) - Williams, D. A. (1976). "Improved Likelihood Ratio Tests for Complete Contingency Tables". Biometrika. 63: 33–37.