Biological validation of differentially expressed genes in chronic lymphocytic leukemia identified by applying multiple statistical methods to oligonucleotide microarrays

被引:9
作者
Abruzzo, LV
Wang, J
Kapoor, M
Medeiros, LJ
Keating, MJ
Highsmith, WE
Barron, LL
Cromwell, CC
Coombes, KR
机构
[1] Univ Texas, MD Anderson Canc Ctr, Dept Hematopathol, Houston, TX 77030 USA
[2] Univ Texas, MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[3] Univ Texas, MD Anderson Canc Ctr, Dept Canc Genet, Houston, TX 77030 USA
[4] Univ Texas, MD Anderson Canc Ctr, Dept Leukemia, Houston, TX 77030 USA
[5] Mayo Clin, Dept Lab Med & Pathol, Rochester, MN USA
关键词
D O I
10.1016/S1525-1578(10)60562-4
中图分类号
R36 [病理学];
学科分类号
100104 ;
摘要
Oligonucleotide microarrays are a powerful tool for profiling the expression levels of thousands of genes. Different statistical methods for identifying differentially expressed genes can yield different results. To our knowledge, no experimental test has been performed to decide which method best identifies genes that are truly differentially expressed. We applied three statistical methods (dChip, t-test on log-transformed data, and Wilcoxon test) to identify differentially expressed genes in previously untreated patients with chronic lymphocytic leukemia (CLL). We used a training set of Affymetrix Hu133A microarray data from 11 patients with unmutated immunoglobulin (Ig) heavy chain variable region (V-H) genes and 8 patients with mutated Ig V-H genes. Differential expression was validated using semiquantitative real-time polymerase chain reaction assays and by validating models to predict the somatic mutation status of an independent test set of nine CLL samples. The methods identified 144 genes that were differentially expressed between cases of CLL with unmutated compared with mutated Ig V-H genes. Eighty genes were identified by Wilcoxon test, 60 by t-test, and 65 by dChip, but only 11 were identified by all three methods. Greater agreement was found between the t-test and the Wilcoxon test. Differential expression was validated by semiquantitative real-time polymerase chain reaction assays for 83% of individual genes, regardless of the statistical method. However, the Wilcoxon test gave the most accurate predictions on new samples, and Whip, the least accurate. We found that all three methods were equally good for finding differentially expressed genes, but they found different genes. The genes selected by the nonparametric Wilcoxon test are the most robust for predicting the status of new cases. A comprehensive list of all differentially expressed genes can only be obtained by combining the results of multiple statistical tests.
引用
收藏
页码:337 / 345
页数:9
相关论文
共 27 条
[1]   Identifying differentially expressed genes in cDNA microarray experiments [J].
Baggerly, KA ;
Coombes, KR ;
Hess, KR ;
Stivers, DN ;
Abruzzo, LV ;
Zhang, W .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) :639-659
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]   Synthesis of adenine and guanine nucleotides at the 'inosinic branch point' in lymphocytes of leukemia patients [J].
Carlucci, F ;
Tabucchi, A ;
Pagani, R ;
Marinello, E .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR BASIS OF DISEASE, 1999, 1454 (01) :106-114
[4]  
CARPENTIERI U, 1980, J CYCLIC NUCL PROT, V6, P253
[5]   Expression of ZAP-70 is associated with increased B-cell receptor signaling in chronic lymphocytic leukemia [J].
Chen, LG ;
Widhopf, G ;
Huynh, L ;
Rassenti, L ;
Rai, KR ;
Weiss, A ;
Kipps, TJ .
BLOOD, 2002, 100 (13) :4609-4614
[6]   RCH1, A PROTEIN THAT SPECIFICALLY INTERACTS WITH THE RAG-1 RECOMBINATION-ACTIVATING PROTEIN [J].
CUOMO, CA ;
KIRCH, SA ;
GYURIS, J ;
BRENT, R ;
OETTINGER, MA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (13) :6156-6160
[7]   Ig V gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia [J].
Damle, RN ;
Wasil, T ;
Fais, F ;
Ghiotto, F ;
Valetto, A ;
Allen, SL ;
Buchbinder, A ;
Budman, D ;
Dittmar, K ;
Kolitz, J ;
Lichtman, SM ;
Schulman, P ;
Vinciguerra, VP ;
Rai, KR ;
Ferrarini, M ;
Chiorazzi, N .
BLOOD, 1999, 94 (06) :1840-1847
[8]  
Durbin B P, 2002, Bioinformatics, V18 Suppl 1, pS105
[9]   Empirical Bayes methods and false discovery rates for microarrays [J].
Efron, B ;
Tibshirani, R .
GENETIC EPIDEMIOLOGY, 2002, 23 (01) :70-86
[10]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868