OPERATOR NORM CONSISTENT ESTIMATION OF LARGE-DIMENSIONAL SPARSE COVARIANCE MATRICES

被引:218
作者
El Karoui, Noureddine [1 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
Covariance matrices; correlation matrices; adjacency matrices; eigenvalues of covariance matrices; multivariate statistical analysis; high-dimensional inference; random matrix theory; sparsity; beta-sparsity;
D O I
10.1214/07-AOS559
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Estimating covariance matrices is a problem of fundamental importance in multivariate statistics. In practice it is increasingly frequent to work with data matrices X of dimension if x p, where p and n are both large. Results from random matrix theory show very clearly that in this setting, standard estimators like the sample covariance matrix perform in general very poorly. In this "large n, large p" setting, it is sometimes the case that practitioners are willing to assume that many elements of the population covariance matrix are equal to 0, and hence this matrix is sparse. We develop an estimator to handle this situation. The estimator is shown to be consistent in operator norm, when, for instance, we have p asymptotic to n as n -> infinity. In other words the largest singular value of the difference between the estimator and the population covariance matrix goes to zero. This implies consistency of all the eigenvalues and consistency of eigenspaces associated to isolated eigenvalues. We also propose a notion of sparsity for matrices, that is, "compatible" with spectral analysis and is independent of the ordering of the variables.
引用
收藏
页码:2717 / 2756
页数:40
相关论文
共 25 条
[1]   A CLT for a band matrix model [J].
Anderson, GW ;
Zeitouni, O .
PROBABILITY THEORY AND RELATED FIELDS, 2006, 134 (02) :283-338
[2]  
Anderson TW., 2003, INTRO MULTIVARIATE S
[3]  
Bai ZD, 2004, ANN PROBAB, V32, P553
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]  
Bhatia R., 2013, MATRIX ANAL
[6]   Regularized estimation of large covariance matrices [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (01) :199-227
[7]   COVARIANCE REGULARIZATION BY THRESHOLDING [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (06) :2577-2604
[8]   First-order methods for sparse covariance selection [J].
D'Aspremont, Alexandre ;
Banerjee, Onureena ;
El Ghaoui, Laurent .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2008, 30 (01) :56-66
[9]   ROTATION OF EIGENVECTORS BY A PERTURBATION .3. [J].
DAVIS, C ;
KAHAN, WM .
SIAM JOURNAL ON NUMERICAL ANALYSIS, 1970, 7 (01) :1-&
[10]   Tracy-Widom limit for the largest eigenvalue of a large class of complex sample covariance matrices [J].
El Karoui, Noureddine .
ANNALS OF PROBABILITY, 2007, 35 (02) :663-714