A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments

被引:177
作者
Hong, Fangxin [1 ]
Breitling, Rainer [2 ]
机构
[1] City Hope Natl Med Ctr, Beckman Res Inst, Dept Biostat, Div Informat Sci, Duarte, CA 91010 USA
[2] Univ Groningen, Groningen Bioinformat Ctr, Groningen Biomol Sci & Biotechnol Inst, NL-9751 NN Haren, Netherlands
关键词
D O I
10.1093/bioinformatics/btm620
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The proliferation of public data repositories creates a need for meta-analysis methods to efficiently evaluate, integrate and validate related datasets produced by independent groups. A t-based approach has been proposed to integrate effect size from multiple studies by modeling both intra- and between-study variation. Recently, a non-parametric rank product method, which is derived based on biological reasoning of fold-change criteria, has been applied to directly combine multiple datasets into one meta study. Fishers Inverse chi(2) method, which only depends on P-values from individual analyses of each dataset, has been used in a couple of medical studies. While these methods address the question from different angles, it is not clear how they compare with each other. Results: We comparatively evaluate the three methods; t-based hierarchical modeling, rank products and Fishers Inverse chi(2) test with P-values from either the t-based or the rank product method. A simulation study shows that the rank product method, in general, has higher sensitivity and selectivity than the t-based method in both individual and meta-analysis, especially in the setting of small sample size and/or large between-study variation. Not surprisingly, Fishers chi(2) method highly depends on the method used in the individual analysis. Application to real datasets demonstrates that meta-analysis achieves more reliable identification than an individual analysis, and rank products are more robust in gene ranking, which leads to a much higher reproducibility among independent studies. Though t-based meta-analysis greatly improves over the individual analysis, it suffers from a potentially large amount of false positives when P-values serve as threshold. We conclude that careful meta-analysis is a powerful tool for integrating multiple array studies.
引用
收藏
页码:374 / 382
页数:9
相关论文
共 40 条
[1]  
[Anonymous], 1991, METAANALYSIS PROCEDU
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]   Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments [J].
Breitling, R ;
Armengaud, P ;
Amtmann, A ;
Herzyk, P .
FEBS LETTERS, 2004, 573 (1-3) :83-92
[4]  
Breitling Rainer, 2005, Journal of Bioinformatics and Computational Biology, V3, P1171, DOI 10.1142/S0219720005001442
[5]   Combining multiple microarray studies and modeling interstudy variation [J].
Choi, Jung Kyoon ;
Yu, Ungsik ;
Kim, Sangsoo ;
Yoo, Ook Joon .
BIOINFORMATICS, 2003, 19 :i84-i90
[6]   THE COMBINATION OF ESTIMATES FROM DIFFERENT EXPERIMENTS [J].
COCHRAN, WG .
BIOMETRICS, 1954, 10 (01) :101-129
[7]  
DeConde R, 2006, STAT APPL GENET MOL, V5
[8]   METAANALYSIS IN CLINICAL-TRIALS [J].
DERSIMONIAN, R ;
LAIRD, N .
CONTROLLED CLINICAL TRIALS, 1986, 7 (03) :177-188
[9]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[10]  
Fisher R. A., 1925, STAT METHODS RES WOR