Determination of the differentially expressed genes in microarray experiments using local FDR

被引:80
作者
Aubert, J [1 ]
Bar-Hen, A [1 ]
Daudin, JJ [1 ]
Robin, S [1 ]
机构
[1] INAPG INRA ENGREF, UMR 518, F-75231 Paris 05, France
关键词
D O I
10.1186/1471-2105-5-125
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Thousands of genes in a genomewide data set are tested against some null hypothesis, for detecting differentially expressed genes in microarray experiments. The expected proportion of false positive genes in a set of genes, called the False Discovery Rate (FDR), has been proposed to measure the statistical significance of this set. Various procedures exist for controlling the FDR. However the threshold (generally 5%) is arbitrary and a specific measure associated with each gene would be worthwhile. Results: Using process intensity estimation methods, we define and give estimates of the local FDR, which may be considered as the probability for a gene to be a false positive. After a global assessment rule controlling the false positive error, the local FDR is a valuable guideline for deciding wether a gene is differentially expressed. The interest of the method is illustrated on three well known data sets. A R routine for computing local FDR estimates from p-values is available at http://www.inapg.fr/ens rech/mathinfo/recherche/mathematique/outil.html. Conclusions: The local FDR associated with each gene measures the probability that it is a false positive. It gives the opportunity to compute the FDR of any given group of clones (of the same gene) or genes pertaining to the same regulation network or the same chromosomic region.
引用
收藏
页数:9
相关论文
共 9 条
  • [1] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [2] Cressie N, 1993, STAT SPATIAL DATA
  • [3] Multiple hypothesis testing in microarray experiments
    Dudoit, S
    Shaffer, JP
    Boldrick, JC
    [J]. STATISTICAL SCIENCE, 2003, 18 (01) : 71 - 103
  • [4] Dudoit S, 2002, STAT SINICA, V12, P111
  • [5] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [6] Gene-expression profiles in hereditary breast cancer.
    Hedenfalk, I
    Duggan, D
    Chen, YD
    Radmacher, M
    Bittner, M
    Simon, R
    Meltzer, P
    Gusterson, B
    Esteller, M
    Kallioniemi, OP
    Wilfond, B
    Borg, Å
    Trent, J
    Raffeld, M
    Yakhini, Z
    Ben-Dor, A
    Dougherty, E
    Kononen, J
    Bubendorf, L
    Fehrle, W
    Pittaluga, S
    Gruvberger, S
    Loman, N
    Johannsoson, O
    Olsson, H
    Sauter, G
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2001, 344 (08) : 539 - 548
  • [7] Johnson NL, 1995, CONTINUOUS UNIVARIAT
  • [8] Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach
    Storey, JD
    Taylor, JE
    Siegmund, D
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2004, 66 : 187 - 205
  • [9] Statistical significance for genomewide studies
    Storey, JD
    Tibshirani, R
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (16) : 9440 - 9445