Significance testing for small microarray experiments

被引:43
作者
Kooperberg, C [1 ]
Aragaki, A [1 ]
Strand, AD [1 ]
Olson, JM [1 ]
机构
[1] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98109 USA
关键词
false positives; type I error; empirical Bayes;
D O I
10.1002/sim.2109
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Which significance test is carried out when the number of repeats is small in microarray experiments can dramatically influence the results. When in two sample comparisons both conditions have fewer than, say, five repeats traditional test statistics require extreme results, before a gene is considered statistically significant differentially expressed after a multiple comparisons correction. In the literature many approaches to circumvent this problem have been proposed. Some of these proposals use (empirical) Bayes arguments to moderate the variance estimates for individual genes. Other proposals try to stabilize these variance estimate by combining groups of genes or similar experiments. In this paper we compare several of these approaches, both on data sets where both experimental conditions are the same, and thus few statistically significant differentially expressed genes should be identified, and on experiments where both conditions do differ. This allows us to identify which approaches are most powerful without identifying many false positives. We conclude that after balancing the numbers of false positives and true positives an empirical Bayes approach and an approach which combines experiments perform best. Standard t-tests are inferior and offer almost no power when the sample size is small. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:2281 / 2298
页数:18
相关论文
共 18 条
[1]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]   Increased huntingtin protein length reduces the number of polyglutamine-induced gene expression changes in mouse models of Huntington's disease [J].
Chan, EYW ;
Luthi-Carter, R ;
Strand, A ;
Solano, SM ;
Hanson, SA ;
DeJohn, MM ;
Kooperberg, C ;
Chase, KO ;
DiFiglia, M ;
Young, AB ;
Leavitt, BR ;
Cha, JHJ ;
Aronin, N ;
Hayden, MR ;
Olson, JM .
HUMAN MOLECULAR GENETICS, 2002, 11 (17) :1939-1951
[4]  
CUI X, IMPROVED STAT TESTS
[5]   Statistical tests for differential expression in cDNA microarray experiments [J].
Cui, XQ ;
Churchill, GA .
GENOME BIOLOGY, 2003, 4 (04)
[6]   Multiple hypothesis testing in microarray experiments [J].
Dudoit, S ;
Shaffer, JP ;
Boldrick, JC .
STATISTICAL SCIENCE, 2003, 18 (01) :71-103
[7]   Comparing three methods for variance estimation with duplicated high density oligonucleotide arrays [J].
Huang X. ;
Pan W. .
Functional & Integrative Genomics, 2002, 2 (3) :126-133
[8]  
Ihaka R., 1996, J COMPUTATIONAL GRAP, V5, P299, DOI [10.1080/10618600.1996.10474713, 10.2307/1390807, DOI 10.1080/10618600.1996.10474713]
[9]   Exploration, normalization, and summaries of high density oligonucleotide array probe level data [J].
Irizarry, RA ;
Hobbs, B ;
Collin, F ;
Beazer-Barclay, YD ;
Antonellis, KJ ;
Scherf, U ;
Speed, TP .
BIOSTATISTICS, 2003, 4 (02) :249-264
[10]   Local-pooled-error test for identifying differentially expressed genes with a small number of replicated microarrays [J].
Jain, N ;
Thatte, J ;
Braciale, T ;
Ley, K ;
O'Connell, M ;
Lee, JK .
BIOINFORMATICS, 2003, 19 (15) :1945-1951