How to decide? Different methods of calculating gene expression from short oligonucleotide array data will give different results

被引:107
作者
Millenaar, FF [1 ]
Okyere, J
May, ST
van Zanten, M
Voesenek, LACJ
Peeters, AJM
机构
[1] Univ Utrecht, Fac Sci, Inst Environm Biol, NL-3584 CA Utrecht, Netherlands
[2] Univ Nottingham, Nottingham Arabidopsis Stock Ctr, Plant Sci Div, Loughborough LE12 5RD, Leics, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1186/1471-2105-7-137
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Short oligonucleotide arrays for transcript profiling have been available for several years. Generally, raw data from these arrays are analysed with the aid of the Microarray Analysis Suite or GeneChip Operating Software (MAS or GCOS) from Affymetrix. Recently, more methods to analyse the raw data have become available. Ideally all these methods should come up with more or less the same results. We set out to evaluate the different methods and include work on our own data set, in order to test which method gives the most reliable results. Results: Calculating gene expression with 6 different algorithms (MAS5, dChip PMMM, dChip PM, RMA, GC-RMA and PDNN) using the same ( Arabidopsis) data, results in different calculated gene expression levels. Consequently, depending on the method used, different genes will be identified as differentially regulated. Surprisingly, there was only 27 to 36% overlap between the different methods. Furthermore, 47.5% of the genes/ probe sets showed good correlation between the mismatch and perfect match intensities. Conclusion: After comparing six algorithms, RMA gave the most reproducible results and showed the highest correlation coefficients with Real Time RT-PCR data on genes identified as differentially expressed by all methods. However, we were not able to verify, by Real Time RT-PCR, the microarray results for most genes that were solely calculated by RMA. Furthermore, we conclude that subtraction of the mismatch intensity from the perfect match intensity results most likely in a significant underestimation for at least 47.5% of the expression values. Not one algorithm produced significant expression values for genes present in quantities below 1 pmol. If the only purpose of the microarray experiment is to find new candidate genes, and too many genes are found, then mutual exclusion of the genes predicted by contrasting methods can be used to narrow down the list of new candidate genes by 64 to 73%.
引用
收藏
页数:16
相关论文
共 37 条
[1]  
*AFF, 2001, MICR SUITUS GUID
[2]   Benchmarking the CATMA microarray. A novel tool for Arabidopsis transcriptome analysis [J].
Allemeersch, J ;
Durinck, S ;
Vanderhaeghen, R ;
Alard, P ;
Maes, R ;
Seeuws, K ;
Bogaert, T ;
Coddens, K ;
Deschouwer, K ;
Van Hummelen, P ;
Vuylsteke, M ;
Moreau, Y ;
Kwekkeboom, J ;
Wijfjes, AHM ;
May, S ;
Beynon, J ;
Hilson, P ;
Kuiper, MTR .
PLANT PHYSIOLOGY, 2005, 137 (02) :588-601
[3]   Comparative analysis of algorithms for signal quantitation from oligonucleotide microarrays [J].
Barash, Y ;
Dehan, E ;
Krupsky, M ;
Franklin, W ;
Geraci, M ;
Friedman, N ;
Kaminski, N .
BIOINFORMATICS, 2004, 20 (06) :839-846
[4]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[5]   Growth stage-based phenotypic analysis of arabidopsis:: A model for high throughput functional genomics in plants [J].
Boyes, DC ;
Zayed, AM ;
Ascenzi, R ;
McCaskill, AJ ;
Hoffman, NE ;
Davis, KR ;
Görlach, J .
PLANT CELL, 2001, 13 (07) :1499-1510
[6]   Accessing genetic information with high-density DNA arrays [J].
Chee, M ;
Yang, R ;
Hubbell, E ;
Berno, A ;
Huang, XC ;
Stern, D ;
Winkler, J ;
Lockhart, DJ ;
Morris, MS ;
Fodor, SPA .
SCIENCE, 1996, 274 (5287) :610-614
[7]   Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset [J].
Choe, SE ;
Boutros, M ;
Michelson, AM ;
Church, GM ;
Halfon, MS .
GENOME BIOLOGY, 2005, 6 (02)
[8]   A benchmark for affymetrix GeneChip expression measures [J].
Cope, LM ;
Irizarry, RA ;
Jaffee, HA ;
Wu, ZJ ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (03) :323-331
[9]   Real-time RT-PCR profiling of over 1400 Arabidopsis transcription factors:: unprecedented sensitivity reveals novel root- and shoot-specific genes [J].
Czechowski, T ;
Bari, RP ;
Stitt, M ;
Scheible, WR ;
Udvardi, MK .
PLANT JOURNAL, 2004, 38 (02) :366-379
[10]   Absolute mRNA concentrations from sequence-specific calibration of oligonucleotide arrays [J].
Hekstra, D ;
Taussig, AR ;
Magnasco, M ;
Naef, F .
NUCLEIC ACIDS RESEARCH, 2003, 31 (07) :1962-1968