An interactive power analysis tool for microarray hypothesis testing and generation

被引:105
作者
Seo, J [1 ]
Gordish-Dressman, H [1 ]
Hoffman, EP [1 ]
机构
[1] Childrens Natl Med Ctr, Med Genet Res Ctr, Washington, DC 20010 USA
关键词
D O I
10.1093/bioinformatics/btk052
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Human clinical projects typically require a priori statistical power analyses. Towards this end, we sought to build a flexible and interactive power analysis tool for microarray studies integrated into our public domain HCE 3.5 software package. We then sought to determine if probe set algorithms or organism type strongly influenced power analysis results. Results: The HCE 3.5 power analysis tool was designed to import any pre-existing Affymetrix microarray project, and interactively test the effects of user-defined definitions of alpha (significance), beta (1 - power), sample size and effect size. The tool generates a filter for all probe sets or more focused ontology-based subsets, with or without noise filters that can be used to limit analyses of a future project to appropriately powered probe sets. We studied projects from three organisms (Arabidopsis, rat, human), and three probe set algorithms (MAS5.0, RMA, dChip PM/MM). We found large differences in power results based on probe set algorithm selection and noise filters. RMA provided high sensitivity for low numbers of arrays, but this came at a cost of high false positive results (24% false positive in the human project studied). Our data suggest that a priori power calculations are important for both experimental design in hypothesis testing and hypothesis generation, as well as for the selection of optimized data analysis parameters.
引用
收藏
页码:808 / 814
页数:7
相关论文
共 17 条
  • [1] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [2] Molecular responses of human muscle to eccentric exercise
    Chen, YW
    Hubal, MJ
    Hoffman, EP
    Thompson, PD
    Clarkson, PM
    [J]. JOURNAL OF APPLIED PHYSIOLOGY, 2003, 95 (06) : 2485 - 2494
  • [3] Expression profiling in the muscular dystrophies: Identification of novel aspects of molecular pathophysiology
    Chen, YW
    Zhao, P
    Borup, R
    Hoffman, EP
    [J]. JOURNAL OF CELL BIOLOGY, 2000, 151 (06) : 1321 - 1336
  • [4] Guidelines - Expression profiling - best practices for data generation and interpretation in clinical trials
    Hoffman, EP
    Awad, T
    Palma, J
    Webster, T
    Hubbell, E
    Warrington, JA
    Spirais, A
    Wright, G
    Buckley, J
    Triche, T
    Davis, R
    Tibshirani, R
    Xiao, WH
    Jones, W
    Tompkins, R
    West, M
    [J]. NATURE REVIEWS GENETICS, 2004, 5 (03) : 229 - 237
  • [5] HOLDER D, 2001, P ASA ANN M ATL GA
  • [6] Summaries of affymetrix GeneChip probe level data
    Irizarry, RA
    Bolstad, BM
    Collin, F
    Cope, LM
    Hobbs, B
    Speed, TP
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (04) : e15
  • [7] Model-based analysis of oligonucleotide arrays: Expression index computation and outlier detection
    Li, C
    Wong, WH
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (01) : 31 - 36
  • [8] False discovery rate, sensitivity and sample size for microarray studies
    Pawitan, Y
    Michiels, S
    Koscielny, S
    Gusnanto, A
    Ploner, A
    [J]. BIOINFORMATICS, 2005, 21 (13) : 3017 - 3024
  • [9] Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays
    Seo, J
    Bakay, M
    Chen, YW
    Hilmer, S
    Shneiderman, B
    Hoffman, EP
    [J]. BIOINFORMATICS, 2004, 20 (16) : 2534 - 2544
  • [10] SEO J, 2006, IN PRESS IEEE T VIS, V12