A simple procedure for estimating the false discovery rate

被引:80
作者
Dalmasso, C [1 ]
Broët, P [1 ]
Moreau, T [1 ]
机构
[1] Fac Med Paris Sud, INSERM, U472, F-94807 Villejuif, France
关键词
D O I
10.1093/bioinformatics/bti063
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The most used criterion in microarray data analysis is nowadays the false discovery rate (FDR). In the framework of estimating procedures based on the marginal distribution of the P-values without any assumption on gene expression changes, estimators of the FDR are necessarily conservatively biased. Indeed, only an upper bound estimate can be obtained for the key quantity pi(0), which is the probability for a gene to be unmodified. In this paper, we propose a novel family of estimators for pi(0) that allows the calculation of FDR. Results: The very simple method for estimating pi(0) called LBE (Location Based Estimator) is presented together with results on its variability. Simulation results indicate that the proposed estimator performs well in finite sample and has the best mean square error in most of the cases as compared with the procedures QVALUE, BUM and SPLOSH. The different procedures are then applied to real datasets.
引用
收藏
页码:660 / 668
页数:9
相关论文
共 16 条
[11]   Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-values [J].
Pounds, S ;
Morris, SW .
BIOINFORMATICS, 2003, 19 (10) :1236-1242
[12]   Improving false discovery rate estimation [J].
Pounds, S ;
Cheng, C .
BIOINFORMATICS, 2004, 20 (11) :1737-1745
[13]  
SCHENA M, 2000, IN PRESS BIOTECHNIQU
[14]   The positive false discovery rate:: A Bayesian interpretation and the q-value [J].
Storey, JD .
ANNALS OF STATISTICS, 2003, 31 (06) :2013-2035
[15]   A direct approach to false discovery rates [J].
Storey, JD .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 :479-498
[16]   Statistical significance for genomewide studies [J].
Storey, JD ;
Tibshirani, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (16) :9440-9445