Jump to content

Functional regression

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Ms.chen (talk | contribs) at 20:50, 12 March 2017 (Concurrent models). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

This sandbox is in the article namespace. Either move this page into your userspace, or remove the {{User sandbox}} template. Functional regression is a version of the regression analysis when responses or covariates include functional data. On the one hand, functional regression models can be classified into four types depending on whether the responses or covariates are functional or scalar: (i) scalar responses with functional covariates, (ii) functional responses with scalar covariates, (iii) functional responses with functional covariates, and (iv) scalar or functional responses with functional and scalar covariates. On the other hand, functional regression models can be linear, partially linear, or nonlinear. In particular, functional polynomial models, functional single and multiple single models and functional additive models are three special cases of functional nonlinear models.

Functional linear models (FLMs)

Functional linear models (FLMs) are an extension of linear models (LMs). A linear model with scalar response and scalar covariates can be written as

where denotes the inner product in Euclidean space, and denote the regression coefficients, and is a random error with mean zero and finite variance. FLMs can be divided into two types based on the responses.

Functional linear models with scalar responses

Functional linear models with scalar responses can be obtained by replacing the scalar covariates and the coefficient vector in model (1) by a centered functional covariate and a coefficient function with domain , respectively, and replacing the inner product in Euclidean space by that in space,

where here denotes the inner product in space. One approach to estimating and is to expand the centered covariate and the coefficient function in the same functional basis, for example, B-spline basis or the eigenbasis used in the Karhunen–Loève expansion. Suppose is an orthonormal basis of space. Expanding and in this basis, , , model (2) becomes In implementation, regularization is needed and can be done through trucation, penalization or penalization[1]. In addition, a reproducing kernel Hilbert space (RKHS) approach can also be used to estimate and in model (2)[2].

Adding multiple functional and scalar covariates, model (2) can be extended to

where are scalar covariates with , are the coefficients corresponding to , respectively, is a centered functional covariate given by , is the corresponding coefficient function, and is the domain of and , for . However, due to the parametric component , the estimation methods for model (2) cannot be used in this case[3] and various estimation methods for model (3) are also available[4][5].

Functional linear models with functional responses

For a functional response with domain and a functional covariate with domain , two FLMs regressing on have been considered[3][6]. One of these two models is of the form

where is still the centered functional covariate, and are coefficient functions, and is usually assumed to be a random process with mean zero and finite variance. In this case, at any given time , the value of , i.e. , depends on the entire trajectory of . Model (4), for any given time , is an extension of multivariate linear regression with the inner product in Euclidean space replaced by that in space. An estimating equation motivated by multivariate linear regression is where , is defined as with for [3]. Regularization is needed and can be done through truncation, penalization or penalization[1]. Various estimation methods for model (4) are also available[7][8].
When and are concurrently observed, i.e. [9], it is sometimes reasonable to consider a historical functional linear model, where the current value of only depends on the history of , i.e. for in model (4)[3][10].
Adding multiple functional covariates, model (4) can be extended to

where for , is a centered functional covariate with domain , and is the corresponding coefficient function with the same domain, respectively. In particular, taking as a constant function yields a special case of model (5) which is a FLM with functional responses and scalar covariates.

Concurrent models

Assuming that , another model, known as concurrent models, sometimes also referred to as varying-coefficient model, has been proposed, which is easier to handle with than the historical functional linear model. A concurrent model is of the form

where and are coefficient functions. Note that model (6) assumes the value of at time , i.e. , only depends on that of at the same time, i.e. . Various estimation methods can be applied for model (6)[11][12].
Adding multiple functional covariates, model (6) can also be extended to where are multiple functional covariates with domain and are the coefficient functions with the same domain.

Functional nonlinear models

Functional polynomial models

Functional polynomial models are an extension of the FLMs, analogous to extending linear regression to polynomial regression. For a scalar response and a functional covariate with domain , the simplest example of functional polynomial models is functional quadratic regression[13] where is the centered functional covariate, is a scalar coefficient, and are coefficient functions with domains and , respectively, and is a random error with mean zero and finite variance. By analogy to FLMs, estimation of functional polynomial models can be obtained through expanding both the centered covariate and the coefficient functions and in an orthonormal basis[13].

Functional single and multiple index models

A functional multiple index model is given by Taking yields a functional single index model. However, this model is problematic due to curse of dimensionality. In other words, with and relatively small sample sizes, this model often leads to high variability of the estimator[14]. Alternatively, a preferable -component functional multiple index model can be formed as Various estimation methods for functional single and multiple index models are available[14][15].

Functional additive models (FAMs)

Given an expansion of a functional covariate with domain in an orthonormal basis : , a functional linear model with scalar responses shown in model (2) can be written as One form of FAMs is obtained by replacing the linear function of , i.e. , by a general smooth function ,

where satisfies for [3]. Apart from model (7), another form of FAMs consists of a sequence of time-additive models:

where are a dense grid on with increasing size , and with a smooth function, for [3]. Various estimation methods for models (7) and (8) are both available[16][17].

Extensions

A direct extension of FLMs with scalar responses shown in model (2) is to add a link function to create a generalized functional linear model (GFLM) by analogy to extending linear regression to generalized linear regression (GLM), of which the three components are:

  1. Linear predictor ;
  2. Variance function , where is the conditional mean;
  3. Link function connecting the conditional mean and the linear predictor through .

See also

References

  1. ^ a b Morris (2015). "Functional regression". Annual Review of Statistics and Its Application. 2:321–359. doi:10.1146/annurev-statistics-010814-020413.
  2. ^ Yuan and Cai (2010). "A reproducing kernel Hilbert space approach to functional linear regression". The Annals of Statistics. 38 (6):3412–3444. doi:10.1214/09-AOS772.
  3. ^ a b c d e f Wang, Chiou and Müller (2016). "Functional data analysis". Annual Review of Statistics and Its Application. 3:257–295. doi:10.1146/annurev-statistics-041715-033624
  4. ^ Kong, Xue, Yao and Zhang (2016). "Partially functional linear regression in high dimensions". Biometrika. 103 (1):147–159. doi:10.1093/biomet/asv062
  5. ^ Hu, Wang and Carroll (2004). "Profile-kernel versus backfitting in the partially linear models for longitudinal/clustered data". Biometrika. 91 (2): 251–262. doi:10.1093/biomet/91.2.251
  6. ^ Ramsay and Silverman (2005). Functional data analysis, 2nd ed., New York : Springer, ISBN 0-387-40080-X
  7. ^ Ramsay and Dalzell (1991). "Some tools for functional data analysis". Journal of the Royal Statistical Society. Series B (Methodological). 53 (3):539–572. http://www.jstor.org/stable/2345586.
  8. ^ Yao, Müller and Wang (2005). "Functional linear regression analysis for longitudinal data". The Annals of Statistics. 33 (6):2873–2903. doi:10.1214/009053605000000660
  9. ^ Grenander (1950). "Stochastic processes and statistical inference". Arkiv Matematik. 1 (3):195–277. doi:10.1007/BF02590638.
  10. ^ Malfait and Ramsay (2003). "The historical functional linear model". Canadian Journal of Statistics. 31 (2):115–128. doi:10.2307/3316063.
  11. ^ Fan and Zhang (1999). "Statistical estimation in varying coefficient models". The Annals of Statistics. 27 (5):1491–1518. doi:10.1214/aos/1017939139.
  12. ^ Huang, Wu and Zhou (2004). "Polynomial spline estimation and inference for varying coefficient models with longitudinal data". Biometrika. 14 (3):763–788. http://www.jstor.org/stable/24307415.
  13. ^ a b Yao and Müller (2010). "Functional quadratic regression". Biometrika. 97 (1):49–64. doi:10.1093/biomet/asp069
  14. ^ a b Chen, Hall and Müller (2011). "Single and multiple index functional regression models with nonparametric link". The Annals of Statistics. 39 (3):1720–1747. doi:10.1214/11-AOS882
  15. ^ Jiang and Wang (2011). "Functional single index models for longitudinal data". 39 (1):362–388. doi:10.1214/10-AOS845
  16. ^ Müller and Yao (2008). "Functional additive models". Journal of the American Statistical Association. 103 (484):1534–1544. doi:10.1198/016214508000000751
  17. ^ Fan, James and Radchenko (2015). "Functional additive regression". The Annals of Statistics. 43 (5):2296–2325. doi:10.1214/15-AOS1346