Empirical evaluation of data transformations and ranking statistics for microarray analysis

被引:61
作者
Qin, LX [1 ]
Kerr, KF [1 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
关键词
D O I
10.1093/nar/gkh866
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
There are many options in handling microarray data that can affect study conclusions, sometimes drastically. Working with a two-color platform, this study uses ten spike-in microarray experiments to evaluate the relative effectiveness of some of these options for the experimental goal of detecting differential expression. We consider two data transformations, background subtraction and intensity normalization, as well as six different statistics for detecting differentially expressed genes. Findings support the use of an intensity-based normalization procedure and also indicate that local background subtraction can be detrimental for effectively detecting differential expression. We also verify that robust statistics outperform t-statistics in identifying differentially expressed genes when there are few replicates. Finally, we find that choice of image analysis software can also substantially influence experimental conclusions.
引用
收藏
页码:5471 / 5479
页数:9
相关论文
共 11 条
[1]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[2]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[3]   A benchmark for affymetrix GeneChip expression measures [J].
Cope, LM ;
Irizarry, RA ;
Jaffee, HA ;
Wu, ZJ ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (03) :323-331
[4]  
CUI X, 2003, STAT METH GENET MOL, V2
[5]  
Kerr MK, 2002, STAT SINICA, V12, P203
[6]   Improved background correction for spotted DNA microarrays [J].
Kooperberg, C ;
Fazzio, TG ;
Delrow, JJ ;
Tsukiyama, T .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (01) :55-66
[7]  
Lönnstedt I, 2002, STAT SINICA, V12, P31
[8]   Towards sound epistemological foundations of statistical methods for high-dimensional biology [J].
Mehta, T ;
Tanik, M ;
Allison, DB .
NATURE GENETICS, 2004, 36 (09) :943-947
[9]   Significance analysis of microarrays applied to the ionizing radiation response [J].
Tusher, VG ;
Tibshirani, R ;
Chu, G .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (09) :5116-5121
[10]   Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation [J].
Yang, YH ;
Dudoit, S ;
Luu, P ;
Lin, DM ;
Peng, V ;
Ngai, J ;
Speed, TP .
NUCLEIC ACIDS RESEARCH, 2002, 30 (04) :e15