The PowerAtlas:: a power and sample size atlas for microarray experimental design and research

被引:33
作者
Page, GP [1 ]
Edwards, JW
Gadbury, GL
Yelisetti, P
Wang, JL
Trivedi, P
Allison, DB
机构
[1] Univ Alabama Birmingham, Dept Biostat, Sect Stat Genet, Birmingham, AL 35294 USA
[2] Iowa State Univ, Dept Agron, USDA ARS, Ames, IA USA
[3] Univ Missouri, Dept Math & Stat, Rolla, MO 65401 USA
关键词
D O I
10.1186/1471-2105-7-84
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Microarrays permit biologists to simultaneously measure the mRNA abundance of thousands of genes. An important issue facing investigators planning microarray experiments is how to estimate the sample size required for good statistical power. What is the projected sample size or number of replicate chips needed to address the multiple hypotheses with acceptable accuracy? Statistical methods exist for calculating power based upon a single hypothesis, using estimates of the variability in data from pilot studies. There is, however, a need for methods to estimate power and/or required sample sizes in situations where multiple hypotheses are being tested, such as in microarray experiments. In addition, investigators frequently do not have pilot data to estimate the sample sizes required for microarray studies. Results: To address this challenge, we have developed a Microrarray PowerAtlas [1]. The atlas enables estimation of statistical power by allowing investigators to appropriately plan studies by building upon previous studies that have similar experimental characteristics. Currently, there are sample sizes and power estimates based on 632 experiments from Gene Expression Omnibus (GEO). The PowerAtlas also permits investigators to upload their own pilot data and derive power and sample size estimates from these data. This resource will be updated regularly with new datasets from GEO and other databases such as The Nottingham Arabidopsis Stock Center (NASC). Conclusion: This resource provides a valuable tool for investigators who are planning efficient microarray studies and estimating required sample sizes.
引用
收藏
页数:9
相关论文
共 16 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]   A mixture model approach for the analysis of microarray gene expression data [J].
Allison, DB ;
Gadbury, GL ;
Heo, MS ;
Fernández, JR ;
Lee, CK ;
Prolla, TA ;
Weindruch, R .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 39 (01) :1-20
[3]  
[Anonymous], MATH CHALLENGES 21 C
[4]  
Ball CA, 2002, SCIENCE, V298, P539
[5]   Gene Expression Omnibus: NCBI gene expression and hybridization array data repository [J].
Edgar, R ;
Domrachev, M ;
Lash, AE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :207-210
[6]   Power and sample size estimation in high dimensional biology [J].
Gadbury, GL ;
Page, GP ;
Edwards, J ;
Kayo, T ;
Prolla, TA ;
Weindruch, R ;
Permana, PA ;
Mountz, JD ;
Allison, DB .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2004, 13 (04) :325-338
[7]  
Kerr MK, 2001, GENET RES, V77, P123
[8]   Analysis of variance for gene expression microarray data [J].
Kerr, MK ;
Martin, M ;
Churchill, GA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (06) :819-837
[9]   Power and sample size for DNA microarray studies [J].
Lee, MLT ;
Whitmore, GA .
STATISTICS IN MEDICINE, 2002, 21 (23) :3543-3570
[10]   Improved statistical inference from DNA microarray data using analysis of variance and a Bayesian statistical framework -: Analysis of global gene expression in Escherichia coli K12 [J].
Long, AD ;
Mangalam, HJ ;
Chan, BYP ;
Tolleri, L ;
Hatfield, GW ;
Baldi, P .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2001, 276 (23) :19937-19944