This article needs attention from an expert in statistics. The specific problem is: no source, and notation/definition problems regarding L.WikiProject Statistics may be able to help recruit an expert.
In statistics the mean squared prediction error of a smoothing or curve fitting procedure is the expected value of the squared difference between the fitted values implied by the predictive function and the values of the (unobservable) function g. It is an inverse measure of the explanatory power of and can be used in the process of cross-validation of an estimated model.
If the smoothing or fitting procedure has operator matrix (i.e., hat matrix) L, which maps the observed values vector to predicted values vector via then
The MSPE can be decomposed into two terms (just like mean squared error is decomposed into bias and variance); however, for MSPE one term is the sum of squared biases of the fitted values and the other is the sum of variances of the fitted values:
Knowledge of g is required in order to calculate MSPE exactly.
Estimation of MSPE
For the model where , one may write
The first term is equivalent to
Thus,
If is known or well-estimated by , it becomes possible to estimate MSPE by
Colin Mallows advocated this method in the construction of his model selection statistic Cp, which is a normalized version of the estimated MSPE:
where p comes from the fact that the number of parameters p estimated for a parametric smoother is given by , and C is in honor of Cuthbert Daniel.[citation needed]