Function-on-scalar regression is a type of function response regression used to analyze the relationship between function response and a set of scalar predictor factors. The variable selection methods of FOSR models mostly focus on the linear effects of scalar predictor factors. Therefore, in this paper, we perform robust variable selection for nonlinear FOSR models with the presence of multiple continuous covariates to further explain the behavior of function response over time. We project functional data into a low dimensional principal component space via the principal component score of the function response, in order to use the principal component score for variable selection and regression modeling. In this paper, we develop a regularized iterative algorithm based on exponential squared loss and group smoothly clipped absolute for predicting estimates of scalar factors and function coefficients, and use tuning parameters chosen by data-driven methods to achieve robustness in variable selection. The robustness of the proposed method is verified through simulation studies and demonstrated on real datasets.
References
[1]
Barber, R.F., Reimherr, M. and Schill, T. (2017) The Function-on-Scalar LASSO with Applications to Longitudinal GWAS. Electronic Journal of Statistics, 11, 1351-1389. https://doi.org/10.1214/17-ejs1260
[2]
Chen, Y., Goldsmith, J. and Ogden, R.T. (2016) Variable Selection in Function-on-Scalar Regression. Stat, 5, 88-101. https://doi.org/10.1002/sta4.106
[3]
Goldsmith, J., Liu, X., Jacobson, J.S. And Rundle, A. (2016) New Insights into Activity Patterns in Children, Found Using Functional Data Analyses. Medicine & Science in Sports & Exercise, 48, 1723-1729. https://doi.org/10.1249/mss.0000000000000968
[4]
Fan, Z. and Reimherr, M. (2017) High-Dimensional Adaptive Function-on-Scalar Regression. Econometrics and Statistics, 1, 167-183. https://doi.org/10.1016/j.ecosta.2016.08.001
[5]
Parodi, A. and Reimherr, M. (2018) Simultaneous Variable Selection and Smoothing for High-Dimensional Function-On-Scalar Regression. Electronic Journal of Statistics, 12, 4602-4639. https://doi.org/10.1214/18-ejs1509
[6]
Ghosal, R. and Maity, A. (2021) Variable Selection in Nonlinear Function-on-Scalar Regression. Biometrics, 79, 292-303. https://doi.org/10.1111/biom.13564
[7]
Wang, X., Jiang, Y., Huang, M. and Zhang, H. (2013) Robust Variable Selection with Exponential Squared Loss. Journal of the American Statistical Association, 108, 632-643. https://doi.org/10.1080/01621459.2013.766613
[8]
Ash, R.B. and Gardner, M.F. (1978) Topics in Stochastic Processes. Academic Press.
[9]
Yao, F., Müller, H. and Wang, J. (2005) Functional Data Analysis for Sparse Longitudinal Data. Journal of the American Statistical Association, 100, 577-590. https://doi.org/10.1198/016214504000001745
[10]
Hall, P. and Hosseini-Nasab, M. (2005) On Properties of Functional Principal Components Analysis. Journal of the Royal Statistical Society Series B: Statistical Methodology, 68, 109-126. https://doi.org/10.1111/j.1467-9868.2005.00535.x
[11]
Fan, J. and Li, R. (2001) Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties. Journal of the American Statistical Association, 96, 1348-1360. https://doi.org/10.1198/016214501753382273
[12]
Wang, L., Chen, G. and Li, H. (2007) Group SCAD Regression Analysis for Microarray Time Course Gene Expression Data. Bioinformatics, 23, 1486-1494. https://doi.org/10.1093/bioinformatics/btm125
[13]
Yao, W., Lindsay, B.G. and Li, R. (2012) Local Modal Regression. Journal of Nonparametric Statistics, 24, 647-663. https://doi.org/10.1080/10485252.2012.678848
[14]
Stone, C.J. (1982) Optimal Global Rates of Convergence for Nonparametric Regression. The Annals of Statistics, 10, 1040-1053. https://doi.org/10.1214/aos/1176345969
[15]
Zou, H. and Li, R. (2008) One-Step Sparse Estimates in Nonconcave Penalized Likelihood Models. The Annals of Statistics, 36, 1509-1533. https://doi.org/10.1214/009053607000000802
[16]
Chen, J. and Chen, Z. (2008) Extended Bayesian Information Criteria for Model Selection with Large Model Spaces. Biometrika, 95, 759-771. https://doi.org/10.1093/biomet/asn034
[17]
Smith, S.M., Jenkinson, M., Johansen-Berg, H., Rueckert, D., Nichols, T.E., Mackay, C.E., et al. (2006) Tract-Based Spatial Statistics: Voxelwise Analysis of Multi-Subject Diffusion Data. NeuroImage, 31, 1487-1505. https://doi.org/10.1016/j.neuroimage.2006.02.024
[18]
Cai, X., Xue, L., Pu, X. and Yan, X. (2020) Efficient Estimation for Varying-Coefficient Mixed Effects Models with Functional Response Data. Metrika, 84, 467-495. https://doi.org/10.1007/s00184-020-00776-0