Hierarchical generalized linear model
Hierarchical generalized linear models(HGLM) can be considered as an extension to generalized linear models. In generalized linear models, the error components are assumed to be independent.[1] However, this assumption does not hold in some situations, which means there exist dependencies among these error terms. Hierarchical generalized linear model allows different error components.[2] The error components can be correlated and not necessarily follow Normal distribution. When there are different clusters, that is, groups of observations, the observations in the same cluster are correlated. In fact, they are positively correlated because observations in the same cluster share some common features. In this situation, using generalized linear models and ignoring the correlated may cause problems.[3]
Overview and Model
Model
In a hierarchical model, observations are grouped into clusters, and the distribution of an observation is determined not only by common structure among all clusters but also by the specific structure of the cluster where this observation belongs. So random effect component, different for different clusters, is introduced into the model. Let be the response, be the random effect, be the link function, , and is some strictly monotone function of . In a hierarchical generalized linear model, the assumption on and need to be made:[2] and
The linear predictor is in the form:
where is the link function, , , and is a monotone function of . In this hierarchical generalized linear model, the fixed effect is described by , which is the same for all observations. The random component is unobserved and varies among clusters randomly. So takes the same value for observations in the same cluster and different values for observations in different clusters. [3]
Identifiability
Identifiability in a concept in statistics. In order to perform parameter inference, it is necessary to make sure that the identifiability property holds.[4] In the model stated above, the location of v is not identifiable, since
for constant .[2] In order to make the model identifiable, we need to impose constraints on parameters. The constraint is usually imposed on random effects, such as .[2]
Models with different distributions and link functions
By assuming different distributions of and , and using different functions of and ', we will be able to obtain different models. In hierarchical generalized linear models, the distributions of random effect do not necessarily follow normal distribution. If the distribution of is normal and the link function of is the identity function, then hierarchical generalized linear model is the same as generalized linear mixed model(GLMM).
Distributions of and can also be chosen to be conjugate, since nice properties hold and it is easier for computation and interpretation.[2] For example, if the distribution of is Poisson with certain mean, the distribution of is Gamma, and canonical log link is used, then we call the model Poisson conjugate HGLM. If follows binomial distribution with certain mean, has the conjugate beta distribution, and canonical logit link is used, then we call the model Beta conjugate model. Moreover, the mixed linear model is in fact the normal conjugate HGLM.[2]
A summary of commonly used models are:[5]
Model name | distribution of y | Link function between y and u | distribution of u | Link function between u and v |
---|---|---|---|---|
Normal conjugate | Normal | Identity | Normal | Identity |
Binomial conjugate | Binomial | Logit | Beta | Logit |
Poisson conjugate | Poisson | Log | Gamma | Log |
Gamma conjugate | Gamma | Reciprocal | Inv-gamma | Reciprocal |
Binomial GLMM | Binomial | Logit | Normal | Identity |
Poisson GLMM | Poisson | Log | Normal | Identity |
Gamma GLMM | Gamma | Log | Normal | Identity |
Fitting the hierarchical generalized linear models
Hierarchical generalized linear models are used when observations come from different clusters. There are two types of estimators: fixed effect estimators and random effect estimators, corresponding to parameters in : and in , respectively. There are different ways to obtain parameter estimates for a hierarchical generalized linear model. If only fixed effect estimators are of interests, the population-averaged model can be used. If inference is focused on individuals, random effects will have to be estimated.[3]
Examples and applications
Hierarchical generalized linear model have been used to solve real-life problems, such as engineering, business, and linguistics. For example, this method was used to analyze semiconductor manufacturing, since interrelated processes form a complex hierarchy.[6] Moreover, marketing research problems can also be analyzed by using hierarchical generalized linear models'.[7]
References
- ^ Generalized Linear Models. Chapman and Hall/CRC. 1989. ISBN 0-412-31760-5.
- ^ a b c d e f . "Hierarchical Generalized Linear Models". Journal of the Royal Statistical Society. Series B (Methodological). Vol. 58, No. 4: 619–678. 1996.
{{cite journal}}
:|volume=
has extra text (help); Check|url=
value (help); Unknown parameter|coauthors=
ignored (|author=
suggested) (help) - ^ a b c Agresti, Alan (2002). Categorical Data Analysis. Hoboken, New Jersey: John Wiley & Sons, Inc. ISBN 0-471-36093-7.
- ^ "Identifiability of Parameters in Latent Structure Models with Many Observed Variables" (PDF). The Annals of Statistics. Vol. 37, No. 6A, : 3099–3132. 2009.
{{cite journal}}
:|volume=
has extra text (help); Unknown parameter|coauthors=
ignored (|author=
suggested) (help)CS1 maint: extra punctuation (link) - ^ "hglm: A Package for Fitting Hierarchical Generalized Linear Models". The R Journal. Vol. 2/2. 2010.
{{cite journal}}
:|volume=
has extra text (help); Unknown parameter|coauthors=
ignored (|author=
suggested) (help); Unknown parameter|month=
ignored (help) - ^ "Hierarchical Modeling Using Generalized Linear Models". Quality and Reliability Engineering International. 2011.
{{cite journal}}
: Unknown parameter|coauthors=
ignored (|author=
suggested) (help) - ^ "Interpreting interrelations across multiple levels in HGLM models: An application in international marketing research". International Marketing Review. Vol. 28 Iss: 1, : pp.34 - 56. 2011.
{{cite journal}}
:|pages=
has extra text (help);|volume=
has extra text (help); Unknown parameter|coauthors=
ignored (|author=
suggested) (help)CS1 maint: extra punctuation (link)