Endogeneity with an exponential regression function
![]() | It has been suggested that this article be merged into Instrumental variables estimation. (Discuss) Proposed since May 2019. |
![]() | The topic of this article may not meet Wikipedia's general notability guideline. (January 2018) |
In statistics as applied to econometrics, exponential regression models constitute a very large and popular class of regression models. Standard econometric concerns such as endogeneity or omitted variables can be accounted for in a more general framework. Wooldridge and Terza provide a methodology to both deal with and test for endogeneity within the exponential regression framework, which the following discussion follows closely.[1] While the example focuses on a Poisson regression model, it is possible to generalize the test to other exponential regression models, although this may come at the cost of additional assumptions (e.g. for binary response or censored data models).
Assume the following exponential regression model, where is an unobserved term in the latent variable. We allow for correlation between and (implying is possibly endogenous), but allow for no such correlation between and .
- (1)
The variables serve as instrumental variables for the potentially endogenous . One can assume a linear relationship between these two variables or alternatively project the endogenous variable onto the instruments to get the following reduced form equation:
- (2)
The usual rank condition is needed to ensure identification. The endogeneity is then modeled in the following way, where determines the severity of endogeneity and is assumed to be independent of .
- (3)
Imposing these assumptions, assuming the models are correctly specified, and normalizing , we can rewrite the conditional mean as follows:
- (4)
If were known at this point, it would be possible to estimate the relevant parameters by quasi-maximum likelihood estimation. Following the two step procedure strategies, Wooldridge and Terza propose estimating equation [2] by standard OLS methods. The fitted residuals from this regression can then be plugged into the estimating equation [4] and QMLE methods will lead to consistent estimators of the parameters of interest. Significance tests on can then be used to test for endogeneity within the model.
The methodology proposed here is often used for exponential regression functions. However, the specific assumptions that need to be made can differ across models. Binary response models impose distributional assumptions on yi and xi, whereas this model imposed independence between and .
See also
- Binary response model with continuous endogenous explanatory variables
- Endogeneity in multinomial response model
References
- ^ Wooldridge 1997; Terza 1998
Bibliography
- Wooldridge, J. (1997): Quasi-Likelihood Methods for Count Data, Handbook of Applied Econometrics, Volume 2, ed. M. H. Pesaran and P. Schmidt, Oxford, Blackwell, pp. 352–406
- Terza, J. V. (1998): "Estimating Count Models with Endogenous Switching: Sample Selection and Endogenous Treatment Effects." Journal of Econometrics (84), pp. 129–154
- Wooldridge, J. (2002): "Econometric Analysis of Cross Section and Panel Data", MIT Press, Cambridge, Massachusetts.