CARMA: A platform for analyzing microarray datasets that incorporate replicate measures

被引:18
作者
Greer, KA
McReynolds, MR
Brooks, HL
Hoying, JB [1 ]
机构
[1] Univ Arizona, Genom Res Lab, Biomed Engn Program, Tucson, AZ 85724 USA
[2] Univ Arizona, Coll Med, Dept Physiol, Tucson, AZ 85724 USA
关键词
D O I
10.1186/1471-2105-7-149
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The incorporation of statistical models that account for experimental variability provides a necessary framework for the interpretation of microarray data. A robust experimental design coupled with an analysis of variance ( ANOVA) incorporating a model that accounts for known sources of experimental variability can significantly improve the determination of differences in gene expression and estimations of their significance. Results: To realize the full benefits of performing analysis of variance on microarray data we have developed CARMA, a microarray analysis platform that reads data files generated by most microarray image processing software packages, performs ANOVA using a user- defined linear model, and produces easily interpretable graphical and numeric results. No pre- processing of the data is required and user- specified parameters control most aspects of the analysis including statistical significance criterion. The software also performs location and intensity dependent lowess normalization, automatic outlier detection and removal, and accommodates missing data. Conclusion: CARMA provides a clear quantitative and statistical characterization of each measured gene that can be used to assess marginally acceptable measures and improve confidence in the interpretation of microarray results. Overall, applying CARMA to microarray datasets incorporating repeated measures effectively reduces the number of gene incorrectly identified as differentially expressed and results in a more robust and reliable analysis.
引用
收藏
页数:13
相关论文
共 39 条
[1]   A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes [J].
Baldi, P ;
Long, AD .
BIOINFORMATICS, 2001, 17 (06) :509-519
[2]  
BATES D, 2005, R ENV
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836
[5]  
CUI X, 2002, VOL CAMDA 02 M DURH
[6]  
Cui X., 2003, STAT APPL GENETICS M, V2
[7]   Improved statistical tests for differential gene expression by shrinking variance components estimates [J].
Cui, XG ;
Hwang, JTG ;
Qiu, J ;
Blades, NJ ;
Churchill, GA .
BIOSTATISTICS, 2005, 6 (01) :59-75
[8]  
DeRisi J, 1996, NAT GENET, V14, P457
[9]   The effect of normalization on microarray data analysis [J].
Ding, YY ;
Wilkins, D .
DNA AND CELL BIOLOGY, 2004, 23 (10) :635-642
[10]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)