Interpretation of ANOVA models for microarray data using PCA

被引:52
作者
de Haan, J. R.
Wehrens, R.
Bauerschmidt, S.
Piek, E.
van Schaik, R. C.
Buydens, L. M. C.
机构
[1] Radboud Univ Nijmegen, Inst Mol & Mat, NL-6525 ED Nijmegen, Netherlands
[2] NV Organon, NL-5340 BH Oss, Netherlands
[3] Radboud Univ Nijmegen, Dept Appl Biol, NL-6525 ED Nijmegen, Netherlands
[4] Radboud Univ Nijmegen, Ctr Mol & Biomol Informat, Nijmegen Ctr Mol Life Sci, NL-6525 ED Nijmegen, Netherlands
关键词
D O I
10.1093/bioinformatics/btl572
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: ANOVA is a technique, which is frequently used in the analysis of microarray data, e.g. to assess the significance of treatment effects, and to select interesting genes based on P-values. However, it does not give information about what exactly is causing the effect. Our purpose is to improve the interpretation of the results from ANOVA on large microarray datasets, by applying PCA on the individual variance components. Interaction effects can be visualized by biplots, showing genes and variables in one plot, providing insight in the effect of e.g. treatment or time on gene expression. Because ANOVA has removed uninteresting sources of variance, the results are much more interpretable than without ANOVA. Moreover, the combination of ANOVA and PCA provides a simple way to select genes, based on the interactions of interest. Results: It is shown that the components from an ANOVA model can be summarized and visualized with PCA, which improves the interpretability of the models. The method is applied to a real time-course gene expression dataset of mesenchymal stem cells. The dataset was designed to investigate the effect of different treatments on osteogenesis. The biplots generated with the algorithm give specific information about the effects of specific treatments on genes over time. These results are in agreement with the literature. The biological validation with GO annotation from the genes present in the selections shows that biologically relevant groups of genes are selected.
引用
收藏
页码:184 / 190
页数:7
相关论文
共 30 条
[21]   Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays [J].
Pan, KH ;
Lih, CJ ;
Cohen, SN .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (25) :8961-8965
[22]   Using ANOVA for gene selection from microarray studies of the nervous system [J].
Pavlidis, P .
METHODS, 2003, 31 (04) :282-289
[23]  
Raychaudhuri S, 2000, Pac Symp Biocomput, P455
[24]   ANOVA-simultaneous component analysis (ASCA): a new tool for analyzing designed metabolomics data [J].
Smilde, AK ;
Jansen, JJ ;
Hoefsloot, HCJ ;
Lamers, RJAN ;
van der Greef, J ;
Timmerman, ME .
BIOINFORMATICS, 2005, 21 (13) :3043-3048
[25]   Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization [J].
Spellman, PT ;
Sherlock, G ;
Zhang, MQ ;
Iyer, VR ;
Anders, K ;
Eisen, MB ;
Brown, PO ;
Botstein, D ;
Futcher, B .
MOLECULAR BIOLOGY OF THE CELL, 1998, 9 (12) :3273-3297
[26]   Evaluation of gene expression measurements from commercial microarray platforms [J].
Tan, PK ;
Downey, TJ ;
Spitznagel, EL ;
Xu, P ;
Fu, D ;
Dimitrov, DS ;
Lempicki, RA ;
Raaka, BM ;
Cam, MC .
NUCLEIC ACIDS RESEARCH, 2003, 31 (19) :5676-5684
[27]  
Tian Hua, 2003, Beijing Da Xue Xue Bao Yi Xue Ban, V35, P317
[28]   Glucocorticoid-induced increase in lymphocytic FKBP51 messenger ribonucleic acid expression: A potential marker for glucocorticoid sensitivity, potency, and bioavailability [J].
Vermeer, H ;
Hendriks-Stegeman, BI ;
van der Burg, B ;
van Buul-Offers, SC ;
Jansen, M .
JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2003, 88 (01) :277-284
[29]   Rosetta error model for gene expression analysis [J].
Weng, L ;
Dai, HY ;
Zhan, YH ;
He, YD ;
Stepaniants, SB ;
Bassett, DE .
BIOINFORMATICS, 2006, 22 (09) :1111-1121
[30]   Assessing gene significance from cDNA microarray expression data via mixed models [J].
Wolfinger, RD ;
Gibson, G ;
Wolfinger, ED ;
Bennett, L ;
Hamadeh, H ;
Bushel, P ;
Afshari, C ;
Paules, RS .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) :625-637