Finding differentially expressed genes in two-channel DNA microarray datasets: How to increase reliability of data preprocessing

被引:7
作者
Rotter, Ana [1 ]
Hren, Matjaz [1 ]
Baebler, Spela [1 ]
Blejec, Andrej [2 ]
Gruden, Kristina [1 ]
机构
[1] Natl Inst Biol, Dept Biotechnol & Syst Biol, Ljubljana 1000, Slovenia
[2] Natl Inst Biol, Dept Entomol, Ljubljana 1000, Slovenia
关键词
D O I
10.1089/omi.2008.0032
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Due to the great variety of preprocessing tools in two-channel expression microarray data analysis it is difficult to choose the most appropriate one for a given experimental setup. In our study, two independent two-channel inhouse microarray experiments as well as a publicly available dataset were used to investigate the influence of the selection of preprocessing methods (background correction, normalization, and duplicate spots correlation calculation) on the discovery of differentially expressed genes. Here we are showing that both the list of differentially expressed genes and the expression values of selected genes depend significantly on the preprocessing approach applied. The choice of normalization method to be used had the highest impact on the results. We propose a simple but efficient approach to increase the reliability of obtained results, where two normalization methods which are theoretically distinct from one another are used on the same dataset. Then the intersection of results, that is, the lists of differentially expressed genes, is used in order to get a more accurate estimation of the genes that were de facto differentially expressed.
引用
收藏
页码:171 / 182
页数:12
相关论文
共 20 条
[1]   A Mouse Stromal Response to Tumor Invasion Predicts Prostate and Breast Cancer Patient Survival [J].
Bacac, Marina ;
Provero, Paolo ;
Mayran, Nathalie ;
Stehle, Jean-Christophe ;
Fusco, Carlo ;
Stamenkovic, Ivan .
PLOS ONE, 2006, 1 (01)
[2]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[3]  
Bolstad BM, 2005, STAT BIOL HEALTH, P13
[4]   Bioinformatic methods for integrating whole-genome expression results into cellular networks [J].
Cavalieri, D ;
De Filippo, C .
DRUG DISCOVERY TODAY, 2005, 10 (10) :727-734
[5]   Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset [J].
Choe, SE ;
Boutros, M ;
Michelson, AM ;
Church, GM ;
Halfon, MS .
GENOME BIOLOGY, 2005, 6 (02)
[6]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836
[7]   A benchmark for affymetrix GeneChip expression measures [J].
Cope, LM ;
Irizarry, RA ;
Jaffee, HA ;
Wu, ZJ ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (03) :323-331
[8]   Statistical challenges in the analysis of two-dimensional difference gel electrophoresis experiments using DeCyder™ [J].
Fodor, IK ;
Nelson, DO ;
Alegria-Hartman, M ;
Robbins, K ;
Langlois, RG ;
Turteltaub, KW ;
Corzett, TH ;
McCutchen-Maloney, SL .
BIOINFORMATICS, 2005, 21 (19) :3733-3740
[9]  
Huber Wolfgang, 2002, Bioinformatics, V18 Suppl 1, pS96
[10]   What is the best reference RNA? And other questions regarding the design and analysis of two-color microarray experiments [J].
Kerr, Kathleen F. ;
Serikawa, Kyle A. ;
Wei, Caimiao ;
Peters, Mette A. ;
Bumgarner, Roger E. .
OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2007, 11 (02) :152-165