Characterizing dye bias in microarray experiments

被引:46
作者
Dobbin, KK [1 ]
Kawasaki, ES
Petersen, DW
Simon, RM
机构
[1] NCI, Biometr Res Branch, NIH, Bethesda, MD 20892 USA
[2] NCI, Ctr Adv Technol, NIH, Bethesda, MD 20892 USA
关键词
D O I
10.1093/bioinformatics/bti378
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Spot intensity serves as a proxy for gene expression in dual-label microarray experiments. Dye bias is defined as an intensity difference between samples labeled with different dyes attributable to the dyes instead of the gene expression in the samples. Dye bias that is not removed by array normalization can introduce bias into comparisons between samples of interest. But if the bias is consistent across samples for the same gene, it can be corrected by proper experimental design and analysis. If the dye bias is not consistent across samples for the same gene, but is different for different samples, then removing the bias becomes more problematic, perhaps indicating a technical limitation to the ability of fluorescent signals to accurately represent gene expression. Thus, it is important to characterize dye bias to determine: (1) whether it will be removed for all genes by array normalization, (2) whether it will not be removed by normalization but can be removed by proper experimental design and analysis and (3) whether dye bias correction is more problematic than either of these and is not easily removable. Results: We analyzed two large (each > 27 arrays) tissue culture experiments with extensive dye swap arrays to better characterize dye bias. Indirect, amino-allyl labeling was used in both experiments. We found that post-normalization dye bias that is consistent across samples does appear to exist for many genes, and that controlling and correcting for this type of dye bias in design and analysis is advisable. The extent of this type of dye bias remained unchanged under a wide range of normalization methods (median-centering, various loess normalizations) and statistical analysis techniques (parametric, rank based, permutation based, etc.). We also found dye bias related to the individual samples for a much smaller subset of genes. But these sample-specific dye biases appeared to have minimal impact on estimated gene-expression differences between the cell lines.
引用
收藏
页码:2430 / 2437
页数:8
相关论文
共 22 条
[1]   ROBUST ESTIMATION IN HETEROSCEDASTIC LINEAR-MODELS [J].
CARROLL, RJ ;
RUPPERT, D .
ANNALS OF STATISTICS, 1982, 10 (02) :429-441
[2]   A COMPARISON BETWEEN MAXIMUM-LIKELIHOOD AND GENERALIZED LEAST-SQUARES IN A HETEROSCEDASTIC LINEAR-MODEL [J].
CARROLL, RJ ;
RUPPERT, D .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1982, 77 (380) :878-882
[3]  
Chen SL, 1996, ONCOGENE, V12, P741
[4]  
Cleveland WS, 1992, Statistical Models in S, P309, DOI [DOI 10.1201/9780203738535-8, 10.1201/9780203738535]
[5]  
Cochran W.G., 1992, Experimental designs
[6]   Sample size determination in microarray experiments for class comparison and prognostic classification [J].
Dobbin, K ;
Simon, R .
BIOSTATISTICS, 2005, 6 (01) :27-38
[7]   Questions and answers on design of dual-label microarrays for identifying differentially expressed genes [J].
Dobbin, K ;
Shih, JH ;
Simon, R .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2003, 95 (18) :1362-1369
[8]   Statistical design of reverse dye microarrays [J].
Dobbin, K ;
Shih, JH ;
Simon, R .
BIOINFORMATICS, 2003, 19 (07) :803-810
[9]   Comparison of microarray designs for class comparison and class discovery [J].
Dobbin, K ;
Simon, R .
BIOINFORMATICS, 2002, 18 (11) :1438-1445
[10]   Gene-specific dye bias in microarray reference designs [J].
Dombkowski, AA ;
Thibodeau, BJ ;
Starcevic, SL ;
Novak, RF .
FEBS LETTERS, 2004, 560 (1-3) :120-124