A new approach to Cholesky-based covariance regularization in high dimensions

被引:105
作者
Rothman, Adam J. [1 ]
Levina, Elizaveta [1 ]
Zhu, Ji [1 ]
机构
[1] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Cholesky decomposition; High-dimensional data; Large p small n; Lasso; Sparsity; LONGITUDINAL DATA; MATRIX ESTIMATION; SELECTION; MODELS; LASSO;
D O I
10.1093/biomet/asq022
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper we propose a new regression interpretation of the Cholesky factor of the covariance matrix, as opposed to the well-known regression interpretation of the Cholesky factor of the inverse covariance, which leads to a new class of regularized covariance estimators suitable for high-dimensional problems. Regularizing the Cholesky factor of the covariance via this regression interpretation always results in a positive definite estimator. In particular, one can obtain a positive definite banded estimator of the covariance matrix at the same computational cost as the popular banded estimator of Bickel & Levina (2008b), which is not guaranteed to be positive definite. We also establish theoretical connections between banding Cholesky factors of the covariance matrix and its inverse and constrained maximum likelihood estimation under the banding constraint, and compare the numerical performance of several methods in simulations and on a sonar data example.
引用
收藏
页码:539 / 550
页数:12
相关论文
共 23 条
[1]  
[Anonymous], 1980, Multivariate Analysis
[2]   Regularized estimation of large covariance matrices [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (01) :199-227
[3]   COVARIANCE REGULARIZATION BY THRESHOLDING [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (06) :2577-2604
[4]   Some theory for Fisher's linear discriminant function, 'naive Bayes', and some alternatives when there are many more variables than observations [J].
Bickel, PJ ;
Levina, E .
BERNOULLI, 2004, 10 (06) :989-1010
[5]   Estimation of a covariance matrix with zeros [J].
Chaudhuri, Sanjay ;
Drton, Mathias ;
Richardson, Thomas S. .
BIOMETRIKA, 2007, 94 (01) :199-216
[6]   First-order methods for sparse covariance selection [J].
D'Aspremont, Alexandre ;
Banerjee, Onureena ;
El Ghaoui, Laurent .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2008, 30 (01) :56-66
[7]   OPERATOR NORM CONSISTENT ESTIMATION OF LARGE-DIMENSIONAL SPARSE COVARIANCE MATRICES [J].
El Karoui, Noureddine .
ANNALS OF STATISTICS, 2008, 36 (06) :2717-2756
[8]   Sparse inverse covariance estimation with the graphical lasso [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Robert .
BIOSTATISTICS, 2008, 9 (03) :432-441
[9]   Estimation of high-dimensional prior and posterior covariance matrices in Kalman filter variants [J].
Furrer, Reinhard ;
Bengtsson, Thomas .
JOURNAL OF MULTIVARIATE ANALYSIS, 2007, 98 (02) :227-255
[10]   Covariance matrix selection and estimation via penalised normal likelihood [J].
Huang, JZ ;
Liu, NP ;
Pourahmadi, M ;
Liu, LX .
BIOMETRIKA, 2006, 93 (01) :85-98