Regularized linear discriminant analysis and its application in microarrays

被引:399
作者
Guo, Yaqian [1 ]
Hastie, Trevor
Tibshirani, Robert
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
关键词
classification; discriminant analysis; microarray; prediction analysis of microarrays (PAM); regularization; shrunken centriods;
D O I
10.1093/biostatistics/kxj035
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we introduce a modified version of linear discriminant analysis, called the "shrunken centroids regularized discriminant analysis" (SCRDA). This method generalizes the idea of the "nearest shrunken centroids" (NSC) (Tibshirani and others, 2003) into the classical discriminant analysis. The SCRDA method is specially designed for classification problems in high dimension low sample size situations, for example, microarray data. Through both simulated data and real life data, it is shown that this method performs very well in multivariate classification problems, often outperforms the PAM method (using the NSC algorithm) and can be as competitive as the support vector machines classifiers. It is also suitable for feature elimination purpose and can be used as gene selection method. The open source R package for this method (named "rda") is available on CRAN (http://www.r-project.org) for download and testing.
引用
收藏
页码:86 / 100
页数:15
相关论文
共 12 条
[1]   FURTHER APPLICATIONS OF BIAS TO DISCRIMINANT-ANALYSIS [J].
DIPILLO, PJ .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1977, 6 (10) :933-943
[2]   APPLICATION OF BIAS TO DISCRIMINANT-ANALYSIS [J].
DIPILLO, PJ .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1976, 5 (09) :843-854
[3]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[4]  
Friedman J, 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5
[5]   REGULARIZED DISCRIMINANT-ANALYSIS [J].
FRIEDMAN, JH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (405) :165-175
[6]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[7]   RIDGE REGRESSION - BIASED ESTIMATION FOR NONORTHOGONAL PROBLEMS [J].
HOERL, AE ;
KENNARD, RW .
TECHNOMETRICS, 1970, 12 (01) :55-&
[8]   Multiclass cancer diagnosis using tumor gene expression signatures [J].
Ramaswamy, S ;
Tamayo, P ;
Rifkin, R ;
Mukherjee, S ;
Yeang, CH ;
Angelo, M ;
Ladd, C ;
Reich, M ;
Latulippe, E ;
Mesirov, JP ;
Poggio, T ;
Gerald, W ;
Loda, M ;
Lander, ES ;
Golub, TR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (26) :15149-15154
[10]   Class prediction by nearest shrunken centroids, with applications to DNA microarrays [J].
Tibshirani, R ;
Hastie, T ;
Narasimhan, B ;
Chu, G .
STATISTICAL SCIENCE, 2003, 18 (01) :104-117