Methods for evaluating gene expression from Affymetrix microarray datasets

被引:23
作者
Jiang, Ning [1 ]
Leach, Lindsey J. [1 ]
Hu, Xiaohua [3 ]
Potokina, Elena
Jia, Tianye [1 ]
Druka, Arnis [1 ,2 ]
Waugh, Robbie
Kearsey, Michael J. [1 ]
Luo, Zewei W. [1 ,3 ]
机构
[1] Univ Birmingham, Sch Biosci, Edgbaston Birmingham B15 2TT, W Midlands, England
[2] Scottish Crop Res Inst, Dundee DD2 5DA, Scotland
[3] Fudan Univ, Inst Biostat, Shanghai 200433, Peoples R China
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1186/1471-2105-9-284
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Affymetrix high density oligonucleotide expression arrays are widely used across all fields of biological research for measuring genome- wide gene expression. An important step in processing oligonucleotide microarray data is to produce a single value for the gene expression level of an RNA transcript using one of a growing number of statistical methods. The challenge for the researcher is to decide on the most appropriate method to use to address a specific biological question with a given dataset. Although several research efforts have focused on assessing performance of a few methods in evaluating gene expression from RNA hybridization experiments with different datasets, the relative merits of the methods currently available in the literature for evaluating genome- wide gene expression from Affymetrix microarray data collected from real biological experiments remain actively debated. Results: The present study reports a comprehensive survey of the performance of all seven commonly used methods in evaluating genome- wide gene expression from a well- designed experiment using Affymetrix microarrays. The experiment profiled eight genetically divergent barley cultivars each with three biological replicates. The dataset so obtained confers a balanced and idealized structure for the present analysis. The methods were evaluated on their sensitivity for detecting differentially expressed genes, reproducibility of expression values across replicates, and consistency in calling differentially expressed genes. The number of genes detected as differentially expressed among methods differed by a factor of two or more at a given false discovery rate (FDR) level. Moreover, we propose the use of genes containing single feature polymorphisms (SFPs) as an empirical test for comparison among methods for the ability to detect true differential gene expression on the basis that SFPs largely correspond to cis- acting expression regulators. The PDNN method demonstrated superiority over all other methods in every comparison, whilst the default Affymetrix MAS5.0 method was clearly inferior. Conclusion: A comprehensive assessment of seven commonly used data extraction methods based on an extensive barley Affymetrix gene expression dataset has shown that the PDNN method has superior performance for the detection of differentially expressed genes.
引用
收藏
页数:10
相关论文
共 27 条
[1]  
*AFF, AFF STAT ALG DESCR D
[2]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[3]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[4]   Large-scale identification of single-feature polymorphisms in complex genomes [J].
Borevitz, JO ;
Liang, D ;
Plouffe, D ;
Chang, HS ;
Zhu, T ;
Weigel, D ;
Berry, CC ;
Winzeler, E ;
Chory, J .
GENOME RESEARCH, 2003, 13 (03) :513-523
[5]   A benchmark for affymetrix GeneChip expression measures [J].
Cope, LM ;
Irizarry, RA ;
Jaffee, HA ;
Wu, ZJ ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (03) :323-331
[6]   Sex genes for genomic analysis in human brain: internal controls for comparison of probe level data extraction. [J].
Galfalvy, HC ;
Erraji-Benchekroun, L ;
Smyrniotopoulos, P ;
Pavlidis, P ;
Ellis, SP ;
Mann, JJ ;
Sibille, E ;
Arango, V .
BMC BIOINFORMATICS, 2003, 4 (1) :1-15
[7]   Comparison of algorithms for the analysis of Affymetrix microarray data as evaluated by co-expression of genes in known operons [J].
Harr, B ;
Schlötterer, C .
NUCLEIC ACIDS RESEARCH, 2006, 34 (02) :1-8
[8]  
Hoffmann R, 2002, GENOME BIOL, V3
[9]   Genetic dissection of ethanol tolerance in the budding yeast Saccharomyces cerevisiae [J].
Hu, X. H. ;
Wang, M. H. ;
Tan, T. ;
Li, J. R. ;
Yang, H. ;
Leach, L. ;
Zhang, R. M. ;
Luo, Z. W. .
GENETICS, 2007, 175 (03) :1479-1487
[10]   Comparison of Affymetrix GeneChip expression measures [J].
Irizarry, RA ;
Wu, ZJ ;
Jaffee, HA .
BIOINFORMATICS, 2006, 22 (07) :789-794