Reduced rank stochastic regression with a sparse singular value decomposition

被引:94
作者
Chen, Kun [2 ]
Chan, Kung-Sik [1 ]
Stenseth, Nils Chr. [3 ]
机构
[1] Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
[2] Kansas State Univ, Manhattan, KS 66506 USA
[3] Univ Oslo, N-0316 Oslo, Norway
基金
美国国家科学基金会;
关键词
Biclustering; Microarrary gene expression data; Multivariate regression; Oracle property; Regularization; VARIABLE SELECTION; REGULARIZATION; DIMENSION; LASSO;
D O I
10.1111/j.1467-9868.2011.01002.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
070103 [概率论与数理统计]; 140311 [社会设计与社会创新];
摘要
. For a reduced rank multivariate stochastic regression model of rank r*, the regression coefficient matrix can be expressed as a sum of r* unit rank matrices each of which is proportional to the outer product of the left and right singular vectors. For improving predictive accuracy and facilitating interpretation, it is often desirable that these left and right singular vectors be sparse or enjoy some smoothness property. We propose a regularized reduced rank regression approach for solving this problem. Computation algorithms and regularization parameter selection methods are developed, and the properties of the new method are explored both theoretically and by simulation. In particular, the regularization method proposed is shown to be selection consistent and asymptotically normal and to enjoy the oracle property. We apply the proposed model to perform biclustering analysis with microarray gene expression data.
引用
收藏
页码:203 / 221
页数:19
相关论文
共 38 条
[1]
Anderson T.W., 2002, SANKHYA A, V64, P193
[2]
Anderson T. W., 2003, An Introduction to Multivariate Statistical Analysis, V3rd
[3]
ESTIMATING LINEAR RESTRICTIONS ON REGRESSION COEFFICIENTS FOR MULTIVARIATE NORMAL DISTRIBUTIONS [J].
ANDERSON, TW .
ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (03) :327-351
[4]
Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[5]
Breheny P, 2009, STAT INTERFACE, V2, P369
[6]
OPTIMAL SELECTION OF REDUCED RANK ESTIMATORS OF HIGH-DIMENSIONAL MATRICES [J].
Bunea, Florentina ;
She, Yiyuan ;
Wegkamp, Marten H. .
ANNALS OF STATISTICS, 2011, 39 (02) :1282-1309
[7]
Biclustering in data mining [J].
Busygin, Stanislav ;
Prokopyev, Oleg ;
Pardalos, Panos M. .
COMPUTERS & OPERATIONS RESEARCH, 2008, 35 (09) :2964-2987
[8]
Tests of rank in reduced rank regression models [J].
Camba-Mendez, G ;
Kapetanios, G ;
Smith, RJ ;
Weale, MR .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2003, 21 (01) :145-155
[9]
Adapting to unknown smoothness via wavelet shrinkage [J].
Donoho, DL ;
Johnstone, IM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1200-1224
[10]
Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499