Power enhancement via multivariate outlier testing with gene expression arrays

被引:17
作者
Asare, Adam L. [1 ]
Gao, Zhong [1 ]
Carey, Vincent J. [2 ]
Wang, Richard [1 ]
Seyfert-Margolis, Vicki [1 ]
机构
[1] Univ Calif San Francisco, Immune Tolerance Network, San Francisco, CA 94143 USA
[2] Harvard Univ, Sch Med, Brigham & Womens Hosp, Channing Lab, Boston, MA 02115 USA
关键词
PROBE LEVEL; MICROARRAY;
D O I
10.1093/bioinformatics/btn591
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: As the use of microarrays in human studies continues to increase, stringent quality assurance is necessary to ensure accurate experimental interpretation. We present a formal approach for microarray quality assessment that is based on dimension reduction of established measures of signal and noise components of expression followed by parametric multivariate outlier testing. Results: We applied our approach to several data resources. First, as a negative control, we found that the Affymetrix and Illumina contributions to MAQC data were free from outliers at a nominal outlier flagging rate of alpha = 0.01. Second, we created a tunable framework for artificially corrupting intensity data from the Affymetrix Latin Square spike-in experiment to allow investigation of sensitivity and specificity of quality assurance (QA) criteria. Third, we applied the procedure to 507 Affymetrix microarray GeneChips processed with RNA from human peripheral blood samples. We show that exclusion of arrays by this approach substantially increases inferential power, or the ability to detect differential expression, in large clinical studies.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 12 条
  • [1] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [2] CARONI C, 1992, APPL STAT-J ROY ST C, V41, P355
  • [3] lumi:: a pipeline for processing Illumina microarray
    Du, Pan
    Kibbe, Warren A.
    Lin, Simon M.
    [J]. BIOINFORMATICS, 2008, 24 (13) : 1547 - 1548
  • [4] MDQC: a new quality assessment method for microarrays based on quality control reports
    Freue, Gabriela V. Cohen
    Hollander, Zsuzsanna
    Shen, Enqing
    Zamar, Ruben H.
    Balshaw, Robert
    Scherer, Andreas
    McManus, Bruce
    Keown, Paul
    McMaster, W. Robert
    Ng, Raymond T.
    [J]. BIOINFORMATICS, 2007, 23 (23) : 3162 - 3169
  • [5] Power and sample size estimation in high dimensional biology
    Gadbury, GL
    Page, GP
    Edwards, J
    Kayo, T
    Prolla, TA
    Weindruch, R
    Permana, PA
    Mountz, JD
    Allison, DB
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2004, 13 (04) : 325 - 338
  • [6] affy -: analysis of Affymetrix GeneChip data at the probe level
    Gautier, L
    Cope, L
    Bolstad, BM
    Irizarry, RA
    [J]. BIOINFORMATICS, 2004, 20 (03) : 307 - 315
  • [7] GROUP TABP, 2004, NAT REV GENET, V5, P229
  • [8] Robust estimators for expression analysis
    Hubbell, E
    Liu, WM
    Mei, R
    [J]. BIOINFORMATICS, 2002, 18 (12) : 1585 - 1592
  • [9] Summaries of affymetrix GeneChip probe level data
    Irizarry, RA
    Bolstad, BM
    Collin, F
    Cope, LM
    Hobbs, B
    Speed, TP
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (04) : e15
  • [10] The PowerAtlas:: a power and sample size atlas for microarray experimental design and research
    Page, GP
    Edwards, JW
    Gadbury, GL
    Yelisetti, P
    Wang, JL
    Trivedi, P
    Allison, DB
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)