Microarrays, empirical Bayes and the two-groups model

被引:264
作者
Efron, Bradley [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
simultaneous tests; empirical null; false discovery rates;
D O I
10.1214/07-STS236
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The classic frequentist theory of hypothesis testing developed by Neyman, Pearson and Fisher has a claim to being the twentieth century's most influential piece of applied mathematics. Something new is happening in the twenty-first century: high-throughput devices, such as microarrays, routinely require simultaneous hypothesis tests for thousands of individual cases, not at all what the classical theory had in mind. In these situations empirical Bayes information begins to force itself upon frequentists and Bayesians alike. The two-groups model is a simple Bayesian construction that facilitates empirical Bayes analysis. This article concerns the interplay of Bayesian and frequentist ideas in the two-groups setting, with particular attention focused on Benjamini and Hochberg's False Discovery Rate method. Topics include the choice and meaning of the null hypothesis in large-scale testing situations, power considerations, the limitations of permutation methods, significance testing for groups of cases (such as pathways in microarray studies), correlation effects, multiple confidence intervals and Bayesian competitors to the two-groups model.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 45 条
[1]   A mixture model approach for the analysis of microarray gene expression data [J].
Allison, DB ;
Gadbury, GL ;
Heo, MS ;
Fernández, JR ;
Lee, CK ;
Prolla, TA ;
Weindruch, R .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 39 (01) :1-20
[2]  
[Anonymous], 2022, Testing statistical hypotheses, DOI DOI 10.1007/978-3-030-70578-7
[3]   Determination of the differentially expressed genes in microarray experiments using local FDR [J].
Aubert, J ;
Bar-Hen, A ;
Daudin, JJ ;
Robin, S .
BMC BIOINFORMATICS, 2004, 5 (1)
[4]   False discovery rate-adjusted multiple confidence intervals for selected parameters [J].
Benjamini, Y ;
Yekutieli, D .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) :71-81
[5]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[6]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]   A comparative review of estimates of the proportion unchanged genes and the false discovery rate [J].
Broberg, P .
BMC BIOINFORMATICS, 2005, 6 (1)
[8]   A Bayesian mixture model for differential gene expression [J].
Do, KA ;
Müller, P ;
Tang, F .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 :627-644
[9]   Multiple hypothesis testing in microarray experiments [J].
Dudoit, S ;
Shaffer, JP ;
Boldrick, JC .
STATISTICAL SCIENCE, 2003, 18 (01) :71-103
[10]   Large-scale simultaneous hypothesis testing: The choice of a null hypothesis [J].
Efron, B .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (465) :96-104