Graphical methods for class prediction using dimension reduction techniques on DNA microarray data

被引:37
作者
Bura, E
Pfeiffer, RM
机构
[1] George Washington Univ, Dept Stat, Washington, DC 20052 USA
[2] NCI, DCEG, Bethesda, MD 20892 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btg150
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: We introduce simple graphical classification and prediction tools for tumor status using gene-expression profiles. They are based on two dimension estimation techniques sliced average variance estimation (SAVE) and sliced inverse regression (SIR). Both SAVE and SIR are used to infer on the dimension of the classification problem and obtain linear combinations of genes that contain sufficient information to predict class membership, such as tumor type. Plots of the estimated directions as well as numerical thresholds estimated from the plots are used to predict tumor classes in cDNA microarrays and the performance of the class predictors is assessed by cross-validation. A microarray simulation study is carried out to compare the power and predictive accuracy of the two methods. Results: The methods are applied to cDNA microarray data on BRCA1 and BRCA2 mutation carriers as well as sporadic tumors from Hedenfalk et al. (2001). All samples are correctly classified.
引用
收藏
页码:1252 / 1258
页数:7
相关论文
共 24 条
[1]   Extending sliced inverse regression: the weighted chi-squared test [J].
Bura, E ;
Cook, RD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (455) :996-1003
[2]   Estimating the structural dimension of regressions via parametric inverse regression [J].
Bura, E ;
Cook, RD .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2001, 63 :393-410
[3]   Dimension reduction strategies for analyzing global gene expression data with a response [J].
Chiaromonte, F ;
Martinelli, J .
MATHEMATICAL BIOSCIENCES, 2002, 176 (01) :123-144
[4]  
Cook D.R., 1999, APPL REGRESSION INCL
[5]  
Cook R. D., 1998, WILEY PROB STAT
[6]   REWEIGHTING TO ACHIEVE ELLIPTICALLY CONTOURED COVARIATES IN REGRESSION [J].
COOK, RD ;
NACHTSHEIM, CJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (426) :592-599
[7]   Identifying regression outliers and mixtures graphically [J].
Cook, RD ;
Critchley, F .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) :781-794
[8]  
COOK RD, 1991, J AM STAT ASSOC, V86, P328, DOI 10.2307/2290564
[9]   Graphics for regressions with a binary response [J].
Cook, RD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (435) :983-992
[10]   Dimension reduction in binary response regression [J].
Cook, RD ;
Lee, H .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (448) :1187-1200