Nonparametric methods for identifying differentially expressed genes in microarray data

被引:225
作者
Troyanskaya, OG
Garber, ME
Brown, PO
Botstein, D [1 ]
Altman, RB
机构
[1] Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA
[2] Stanford Univ, Sch Med, Dept Biochem, Stanford, CA 94305 USA
[3] Stanford Univ, Howard Hughes Med Inst, Stanford, CA 94305 USA
关键词
D O I
10.1093/bioinformatics/18.11.1454
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene expression experiments provide a fast and systematic way to identify disease markers relevant to clinical care. In this study, we address the problem of robust identification of differentially expressed genes from microarray data. Differentially expressed genes, or discriminator genes, are genes with significantly different expression in two user-defined groups of microarray experiments. We compare three model-free approaches: (1) nonparametric t-test, (2) Wilcoxon (or Mann-Whitney) rank sum test, and (3) a heuristic method based on high Pearson correlation to a perfectly differentiating gene ('ideal discriminator method'). We systematically assess the performance of each method based on simulated and biological data under varying noise levels and p-value cutoffs. Results: All methods exhibit very low false positive rates and identify a large fraction of the differentially expressed genes in simulated data sets with noise level similar to that of actual data. Overall, the rank sum test appears most conservative, which may be advantageous when the computationally identified genes need to be tested biologically. However, if a more inclusive list of markers is desired, a higher p-value cutoff or the nonparametric t-test may be appropriate. When applied to data from lung tumor and lymphoma data sets, the methods identify biologically relevant differentially expressed genes that allow clear separation of groups in question. Thus the methods described and evaluated here provide a convenient and robust way to identify differentially expressed genes for further biological and clinical analysis.
引用
收藏
页码:1454 / 1461
页数:8
相关论文
共 32 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes
    Baldi, P
    Long, AD
    [J]. BIOINFORMATICS, 2001, 17 (06) : 509 - 519
  • [3] Tissue classification with gene expression profiles
    Ben-Dor, A
    Bruhn, L
    Friedman, N
    Nachman, I
    Schummer, M
    Yakhini, Z
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) : 559 - 583
  • [4] CYTOKERATINS IN NORMAL LUNG AND LUNG CARCINOMAS .1. ADENOCARCINOMAS, SQUAMOUS-CELL CARCINOMAS AND CULTURED-CELL LINES
    BLOBEL, GA
    MOLL, R
    FRANKE, WW
    VOGTMOYKOPF, I
    [J]. VIRCHOWS ARCHIV B-CELL PATHOLOGY INCLUDING MOLECULAR PATHOLOGY, 1984, 45 (04) : 407 - 429
  • [5] Exploring the metabolic and genetic control of gene expression on a genomic scale
    DeRisi, JL
    Iyer, VR
    Brown, PO
    [J]. SCIENCE, 1997, 278 (5338) : 680 - 686
  • [6] DUDOIT S, 2002, IN PRESS STAT SINICA
  • [7] DUDOIT S, 2002, IN PRESS J AM STAT A
  • [8] Ploidy regulation of gene expression
    Galitski, T
    Saldanha, AJ
    Styles, CA
    Lander, ES
    Fink, GR
    [J]. SCIENCE, 1999, 285 (5425) : 251 - 254
  • [9] Diversity of gene expression in adenocarcinoma of the lung
    Garber, ME
    Troyanskaya, OG
    Schluens, K
    Petersen, S
    Thaesler, Z
    Pacyna-Gengelbach, M
    van de Rijn, M
    Rosen, GD
    Perou, CM
    Whyte, RI
    Altman, RB
    Brown, PO
    Botstein, D
    Petersen, I
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) : 13784 - 13789
  • [10] Genomic expression programs in the response of yeast cells to environmental changes
    Gasch, AP
    Spellman, PT
    Kao, CM
    Carmel-Harel, O
    Eisen, MB
    Storz, G
    Botstein, D
    Brown, PO
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2000, 11 (12) : 4241 - 4257