Effects of pooling mRNA in microarray class comparisons

被引:46
作者
Shih, JH [1 ]
Michalowska, AM
Dobbin, K
Ye, YM
Qiu, TH
Green, JE
机构
[1] NCI, Biometr Res Branch, Div Canc Treatment & Diag, Bethesda, MD 20892 USA
[2] NCI, Lab Cell Regulat & Carcinogenesis, Bethesda, MD 20892 USA
关键词
D O I
10.1093/bioinformatics/bth391
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In microarray experiments investigators sometimes wish to pool RNA samples before labeling and hybridization due to insufficient RNA from each individual sample or to reduce the number of arrays for the purpose of saving cost. The basic assumption of pooling is that the expression of an mRNA molecule in the pool is close to the average expression from individual samples. Recently, a method for studying the effect of pooling mRNA on statistical power in detecting differentially expressed genes between classes has been proposed, but the different sources of variation arising in microarray experiments were not distinguished. Another paper recently did take different sources of variation into account, but did not address power and sample size for class comparison. In this paper, we study the implication of pooling in detecting differential gene expression taking into account different sources of variation and check the basic assumption of pooling using data from both the cDNA and Affymetrix GeneChip microarray experiments. Results: We present formulas for the required number of subjects and arrays to achieve a desired power at a specified significance level. We show that due to the loss of degrees of freedom for a pooled design, a large increase in the number of subjects may be required to achieve a power comparable to that of a non-pooled design. The added expense of additional samples for the pooled design may outweigh the benefit of saving on microarray cost. The microarray data from both platforms show that the major assumption of pooling may not hold.
引用
收藏
页码:3318 / 3325
页数:8
相关论文
共 10 条
  • [1] Initiating oncogenic event determines gene-expression patterns of human breast cancer models
    Desai, KV
    Xiao, N
    Wang, W
    Gangi, L
    Greene, J
    Powell, JI
    Dickson, R
    Furth, P
    Hunter, K
    Kucherlapati, R
    Simon, R
    Liu, ET
    Green, JE
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) : 6967 - 6972
  • [2] Questions and answers on design of dual-label microarrays for identifying differentially expressed genes
    Dobbin, K
    Shih, JH
    Simon, R
    [J]. JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2003, 95 (18) : 1362 - 1369
  • [3] Statistical design of reverse dye microarrays
    Dobbin, K
    Shih, JH
    Simon, R
    [J]. BIOINFORMATICS, 2003, 19 (07) : 803 - 810
  • [4] Comparison of microarray designs for class comparison and class discovery
    Dobbin, K
    Simon, R
    [J]. BIOINFORMATICS, 2002, 18 (11) : 1438 - 1445
  • [5] POWER AND SAMPLE-SIZE CALCULATIONS - A REVIEW AND COMPUTER-PROGRAM
    DUPONT, WD
    PLUMMER, WD
    [J]. CONTROLLED CLINICAL TRIALS, 1990, 11 (02): : 116 - 128
  • [6] Summaries of affymetrix GeneChip probe level data
    Irizarry, RA
    Bolstad, BM
    Collin, F
    Cope, LM
    Hobbs, B
    Speed, TP
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (04) : e15
  • [7] The efficiency of pooling mRNA in microarray experiments
    Kendziorski, CM
    Zhang, Y
    Lan, H
    Attie, AD
    [J]. BIOSTATISTICS, 2003, 4 (03) : 465 - 477
  • [8] Statistical implications of pooling RNA samples for microarray experiments
    Peng, XJ
    Wood, CL
    Blalock, EM
    Chen, KC
    Landfield, PW
    Stromberg, AJ
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)
  • [9] WU Z, 2004, IN PRESS J AM STAT A
  • [10] Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation
    Yang, YH
    Dudoit, S
    Luu, P
    Lin, DM
    Peng, V
    Ngai, J
    Speed, TP
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (04) : e15