A practical false discovery rate approach to identifying patterns of differential expression in microarray data

被引:73
作者
Grant, GR [1 ]
Liu, JM [1 ]
Stoeckert, CJ [1 ]
机构
[1] Univ Penn, Ctr Bioinformat, Philadelphia, PA 19104 USA
关键词
D O I
10.1093/bioinformatics/bti407
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Searching for differentially expressed genes is one of the most common applications for microarrays, yet statistically there are difficult hurdles to achieving adequate rigor and practicality. False discovery rate (FDR) approaches have become relatively standard; however, how to define and control the FDR has been hotly debated. Permutation estimation approaches such as SAM and PaGE can be effective; however, they leave much room for improvement. We pursue the permutation estimation method and describe a convenient definition for the FDR that can be estimated in a straightforward manner. We then discuss issues regarding the choice of statistic and data transformation. It is impossible to optimize the power of any statistic for thousands of genes simultaneously, and we look at the practical consequences of this. For example, the log transform can both help and hurt at the same time, depending on the gene. We examine issues surrounding the SAM 'fudge factor' parameter, and how to handle these issues by optimizing with respect to power.
引用
收藏
页码:2684 / 2690
页数:7
相关论文
共 13 条
[1]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]   Empirical Bayes methods and false discovery rates for microarrays [J].
Efron, B ;
Tibshirani, R .
GENETIC EPIDEMIOLOGY, 2002, 23 (01) :70-86
[4]  
GE YC, 2003, TEST, V12, P1, DOI DOI 10.1007/BF02595811
[5]  
GRANT GR, 2005, PERFORMANCE ANAL DIF
[6]   Generation of patterns from gene expression data by assigning confidence to differentially expressed genes [J].
Manduchi, E ;
Grant, GR ;
McKenzie, SE ;
Overton, GC ;
Surrey, S ;
Stoeckert, CJ .
BIOINFORMATICS, 2000, 16 (08) :685-698
[7]   On the use of permutation in and the performance of a class of nonparametric methods to detect differential gene expression [J].
Pan, W .
BIOINFORMATICS, 2003, 19 (11) :1333-1340
[8]   Improving false discovery rate estimation [J].
Pounds, S ;
Cheng, C .
BIOINFORMATICS, 2004, 20 (11) :1737-1745
[9]  
SIMMONS C, 2005, IN PRESS CIRC RES
[10]  
Storey J.D., 2003, The Analysis of Gene Expression Data, P272