A search engine to identify pathway genes from expression data on multiple organisms

被引:6
作者
Chen, Chunnuan
Weirauch, Matthew T.
Powell, Corey C.
Zambon, Alexander C.
Stuart, Joshua M. [1 ]
机构
[1] Univ Calif Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA
[2] Gladstone Inst Cardiovasc Dis, Dept Med, San Francisco, CA 94158 USA
关键词
D O I
10.1186/1752-0509-1-20
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The completion of several genome projects showed that most genes have not yet been characterized, especially in multicellular organisms. Although most genes have unknown functions, a large collection of data is available describing their transcriptional activities under many different experimental conditions. In many cases, the coregulatation of a set of genes across a set of conditions can be used to infer roles for genes of unknown function. Results: We developed a search engine, the Multiple-Species Gene Recommender (MSGR), which scans gene expression datasets from multiple organisms to identify genes that participate in a genetic pathway. The MSGR takes a query consisting of a list of genes that function together in a genetic pathway from one of six organisms: Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana, and Helicobacter pylori. Using a probabilistic method to merge searches, the MSGR identifies genes that are significantly coregulated with the query genes in one or more of those organisms. The MSGR achieves its highest accuracy for many human pathways when searches are combined across species. We describe specific examples in which new genes were identified to be involved in a neuromuscular signaling pathway and a cell-adhesion pathway. Conclusion: The search engine can scan large collections of gene expression data for new genes that are significantly coregulated with a pathway of interest. By integrating searches across organisms, the MSGR can identify pathway members whose coregulation is either ancient or newly evolved.
引用
收藏
页数:19
相关论文
共 45 条
[1]   Transcription factor Egr-1 activates collagen expression in immortalized fibroblasts or fibrosarcoma cells [J].
Alexander, D ;
Judex, M ;
Meyringer, R ;
Weis-Klemm, M ;
Gay, S ;
Müller-Ladner, U ;
Aicher, WK .
BIOLOGICAL CHEMISTRY, 2002, 383 (12) :1845-1853
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   The role of Zic genes in neural development [J].
Aruga, J .
MOLECULAR AND CELLULAR NEUROSCIENCE, 2004, 26 (02) :205-221
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   The MAZ protein is an autoantigen of Hodgkin's disease and paraneoplastic cerebellar dysfunction [J].
Bataller, L ;
Wade, DF ;
Graus, F ;
Rosenfeld, MR ;
Dalmau, J .
ANNALS OF NEUROLOGY, 2003, 53 (01) :123-127
[6]   Spectrin αII and βII isoforms interact with high affinity at the tetramerization site [J].
Bignone, PA ;
Baines, AJ .
BIOCHEMICAL JOURNAL, 2003, 374 :613-624
[7]  
CHENG Y, 2000, P 8 INT C INT SYST M, P93
[8]   Presynaptic calcium stores and synaptic transmission [J].
Collin, T ;
Marty, A ;
Llano, I .
CURRENT OPINION IN NEUROBIOLOGY, 2005, 15 (03) :275-281
[9]   Deubiquitinating enzymes: A new class of biological regulators [J].
D'Andrea, A ;
Pellman, D .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1998, 33 (05) :337-352
[10]   GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways [J].
Dahlquist, KD ;
Salomonis, N ;
Vranizan, K ;
Lawlor, SC ;
Conklin, BR .
NATURE GENETICS, 2002, 31 (01) :19-20