The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based on complete data. This paper studies the optimal estimation of high-dimensional covariance matrices based on missing and noisy sample under the norm. First, the model with sub-Gaussian additive noise is presented. The generalized sample covariance is then modified to define a hard thresholding estimator , and the minimax upper bound is derived. After that, the minimax lower bound is derived, and it is concluded that the estimator presented in this article is rate-optimal. Finally, numerical simulation analysis is performed. The result shows that for missing samples with sub-Gaussian noise, if the true covariance matrix is sparse, the hard thresholding estimator outperforms the traditional estimate method.
References
[1]
Bickel, P.J. and Levina, E. (2008) Covariance Regularization by Thresholding. The Annals of Statistics, 36, 2577-2604. https://doi.org/10.1214/08-AOS600
[2]
Cai, T.T. and Zhou, H.H. (2012) Optimal Rates of Convergence for Sparse Covariance Matrix Estimation. The Annals of Statistics, 40, 2389-2420. https://doi.org/10.1214/12-AOS998
[3]
Cai, T.T. and Zhou, H.H. (2012) Minimax Estimation of Large Covariance Matrices under L1-Norm. Statistica Sinica, 22, 1319-1349. https://doi.org/10.5705/ss.2010.253
[4]
Fan, J. and Li, R. (2001) Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties. Journal of the American Statistical Association, 96, 1348-1360. http://www.jstor.org/stable/3085904 https://doi.org/10.1198/016214501753382273
[5]
Zou, H. (2006) The Adaptive Lasso and Its Oracle Properties. Journal of the American Statistical Association, 101, 1418-1429. https://doi.org/10.1198/016214506000000735
[6]
Rothman, A.J., Levina, E. and Zhu, J. (2009) Generalized Thresholding of Large Covariance Matrices. Journal of the American Statistical Association, 104, 177-186. https://doi.org/10.1198/jasa.2009.0101
[7]
Cai, T.T. and Liu, W. (2011) Adaptive Thresholding for Sparse Covariance Matrix Estimation. Journal of the American Statistical Association, 106, 672-684. http://www.jstor.org/stable/41416401 https://doi.org/10.1198/jasa.2011.tm10560
[8]
Cai, T.T., Liu, W. and Zhou, H.H. (2016) Estimating Sparse Precision Matrix: Optimal Rates of Convergence and Adaptive Estimation. The Annals of Statistics, 44, 455-488. https://doi.org/10.1214/13-AOS1171
[9]
Bickel, P.J. and Levina, E. (2008) Regularized Estimation of Large Covariance Matrices. The Annals of Statistics, 36, 199-227. https://doi.org/10.1214/009053607000000758
[10]
Cai, T.T., Zhang, C.H. and Zhou, H.H. (2010) Optimal Rates of Convergence for Covariance Matrix Estimation. The Annals of Statistics, 38, 2118-2144. https://doi.org/10.1214/09-AOS752
[11]
Cai, T.T. and Zhang, A. (2016) Minimax Rate-Optimal Estimation of High-Dimensional Covariance Matrices with Incomplete Data. Journal of Multivariate Analysis, 150, 55-74. https://doi.org/10.1016/j.jmva.2016.05.002
[12]
Qi, X. (2022) Low Rank Matrix Perturbation Analysis and Estimation for Two Classes of Sparse Covariance Matrices. Ph.D. Thesis, Beijing University, Beijing.
[13]
Shi, W. (2022) Optimal Estimation of Bandable Covariance Matrices Based on Noised Consored Data. MSc. Thesis, Beijing University, Beijing.
[14]
Tsybakov, A.B. (2009) Introduction to Nonparametric Estimation. Springer-Verlag, New York. https://doi.org/10.1007/b13794