A scalable method for integration and functional analysis of multiple microarray datasets

被引:96
作者
Huttenhower, Curtis [1 ]
Hibbs, Matt [1 ]
Myers, Chad [1 ]
Troyanskaya, Olga G. [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Lewis Sinler Inst Integrat Genom, Princeton, NJ 08544 USA
关键词
D O I
10.1093/bioinformatics/btl492
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The diverse microarray datasets that have become available over the past several years represent a rich opportunity and challenge for biological data mining. Many supervised and unsupervised methods have been developed for the analysis of individual microarray datasets. However, integrated analysis of multiple datasets can provide a broader insight into genetic regulation of specific biological pathways under a variety of conditions. Results: To aid in the analysis of such large compendia of microarray experiments, we present Microarray Experiment Functional Integration Technology (MEFIT), a scalable Bayesian framework for predicting functional relationships from integrated microarray datasets. Furthermore, MEFIT predicts these functional relationships within the context of specific biological processes. All results are provided in the context of one or more specific biological functions, which can be provided by a biologist or drawn automatically from catalogs such as the Gene Ontology (GO). Using MEFIT, we integrated 40 Saccharomyces cerevisiae microarray datasets spanning 712 unique conditions. In tests based on 110 biological functions drawn from the GO biological process ontology, MEFIT provided a 5% or greater performance increase for 54 functions, with a 5% or more decrease in performance in only two functions.
引用
收藏
页码:2890 / 2897
页数:8
相关论文
共 60 条
[1]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[2]   A Rsc3/Rsc30 zinc cluster dimer reveals novel roles for the chromatin remodeler RSC in gene expression and cell cycle control [J].
Angus-Hill, ML ;
Schlichter, A ;
Roberts, D ;
Erdjument-Bromage, H ;
Tempst, P ;
Cairns, BR .
MOLECULAR CELL, 2001, 7 (04) :741-751
[3]  
[Anonymous], 2000, ISMB
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   Identifying differentially expressed genes in cDNA microarray experiments [J].
Baggerly, KA ;
Coombes, KR ;
Hess, KR ;
Stivers, DN ;
Abruzzo, LV ;
Zhang, W .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) :639-659
[6]  
Ball CA, 2005, NUCLEIC ACIDS RES, V33, pD580
[7]  
Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
[8]   Iterative signature algorithm for the analysis of large-scale gene expression data [J].
Bergmann, S ;
Ihmels, J ;
Barkai, N .
PHYSICAL REVIEW E, 2003, 67 (03) :18
[9]   Expression profiling of the schizont and trophozoite stages of Plasmodium falciparum with a long-oligonucleotide microarray -: art. no. R9 [J].
Bozdech, Z ;
Zhu, JC ;
Joachimiak, MP ;
Cohen, FE ;
Pulliam, B ;
DeRisi, JL .
GENOME BIOLOGY, 2003, 4 (02)
[10]   The landscape of genetic complexity across 5,700 gene expression traits in yeast [J].
Brem, RB ;
Kruglyak, L .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (05) :1572-1577