Analysis of gene expression in pathophysiological states: Balancing false discovery and false negative rates

被引:42
作者
Norris, AW
Kahn, CR
机构
[1] Joslin Diabet Ctr, Boston, MA 02215 USA
[2] Childrens Hosp, Boston, MA 02115 USA
[3] Harvard Univ, Sch Med, Boston, MA 02115 USA
关键词
metabolic disease; microarray analysis; multiple hypothesis testing; statistics;
D O I
10.1073/PNAS.0510115103
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nucleotide-microarray technology, which allows the simultaneous measurement of the expression of tens of thousands of genes, has become an important tool in the study of disease. In disorders such as malignancy, gene expression often undergoes broad changes of sizable magnitude, whereas in many common multifactorial diseases, such as diabetes, obesity, and atherosclerosis, the changes in gene expression are modest. In the latter circumstance, it is therefore challenging to distinguish the truly changing from nonchanging genes, especially because statistical significance must be considered in the context of multiple hypothesis testing. Here, we present a balanced probability analysis (BPA), which provides the biologist with an approach to interpret results in the context of the total number of genes truly differentially expressed and false discovery and false negative rates for the list of genes reaching any significance threshold. In situations where the changes are of modest magnitude, sole consideration of the false discovery rate can result in poor power to detect genes truly differentially expressed. Concomitant analysis of the rate of truly differentially expressed genes not identified, i.e., the false negative rate, allows balancing of the two error rates and a more thorough insight into the data. To this end, we have developed a unique, model-based procedure for the estimation of false negative rates, which allows application of BPA to real data in which changes are modest.
引用
收藏
页码:649 / 653
页数:5
相关论文
共 22 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]  
Butte A J, 2001, Pac Symp Biocomput, P6
[3]   Multiple-testing strategy for analyzing cDNA array data on gene expression [J].
Delongchamp, RR ;
Bowyer, JF ;
Chen, JJ ;
Kodell, RL .
BIOMETRICS, 2004, 60 (03) :774-782
[4]   Evolutionary algorithms for finding optimal gene sets in microarray prediction [J].
Deutsch, JM .
BIOINFORMATICS, 2003, 19 (01) :45-52
[5]   Resampling-based multiple testing for microarray data analysis [J].
Ge, YC ;
Dudoit, S ;
Speed, TP .
TEST, 2003, 12 (01) :1-77
[6]   Operating characteristics and extensions of the false discovery rate procedure [J].
Genovese, C ;
Wasserman, L .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 :499-517
[7]  
GREENBERG RA, 1969, AM REV RESPIR DIS, V100, P645
[8]  
Hsueh Huey-miin, 2003, J Biopharm Stat, V13, P675, DOI 10.1081/BIP-120024202
[9]   Analyzing protein function on a genomic scale: the importance of gold-standard positives and negatives for network prediction [J].
Jansen, R ;
Gerstein, M .
CURRENT OPINION IN MICROBIOLOGY, 2004, 7 (05) :535-545
[10]   SIGNAL DETECTABILITY AND MEDICAL DECISION-MAKING [J].
LUSTED, LB .
SCIENCE, 1971, 171 (3977) :1217-&