Mining biological networks for unknown pathways

被引:19
作者
Cakmak, Ali [1 ]
Ozsoyoglu, Gultekin [1 ]
机构
[1] Case Western Reserve Univ, Dept Elect Engn & Comp Sci, Cleveland, OH 44106 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btm409
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Biological pathways provide significant insights on the interaction mechanisms of molecules. Presently, many essential pathways still remain unknown or incomplete for newly sequenced organisms. Moreover, experimental validation of enormous numbers of possible pathway candidates in a wet-lab environment is time-and effort-extensive. Thus, there is a need for comparative genomics tools that help scientists predict pathways in an organism's biological network. Results: In this article, we propose a technique to discover unknown pathways in organisms. Our approach makes in- depth use of Gene Ontology (GO)-based functionalities of enzymes involved in metabolic pathways as follows: (i) Model each pathway as a biological functionality graph of enzyme GO functions, which we call pathway functionality template. (ii) Locate frequent pathway functionality patterns so as to infer previously unknown pathways through pattern matching in metabolic networks of organisms. We have experimentally evaluated the accuracy of the presented technique for 30 bacterial organisms to predict around 1500 organism-specific versions of 50 reference pathways. Using cross-validation strategy on known pathways, we have been able to infer pathways with 86% precision and 72% recall for enzymes (i.e. nodes). The accuracy of the predicted enzyme relationships has been measured at 85% precision with 64% recall.
引用
收藏
页码:2775 / 2783
页数:9
相关论文
共 46 条
[1]  
[Anonymous], 2002, INT C DAT MIN
[2]  
BANG JW, 2003, P WORKSH QUAL MOD BA
[3]   Reconstruction of amino acid biosynthesis pathways from the complete genome sequence [J].
Bono, H ;
Ogata, H ;
Goto, S ;
Kanehisa, M .
GENOME RESEARCH, 1998, 8 (03) :203-210
[4]  
CAKMAK A, 2007, MINING BIOL NETWORKS
[5]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[6]   A signaling mucin at the head of the Cdc42- and MAPK-dependent filamentous growth pathway in yeast [J].
Cullen, PJ ;
Sabbagh, W ;
Graham, E ;
Irick, MM ;
van Olden, EK ;
Neal, C ;
Delrow, J ;
Bardwell, L ;
Sprague, GF .
GENES & DEVELOPMENT, 2004, 18 (14) :1695-1708
[7]   Comparative genome analysis and pathway reconstruction [J].
Dandekar, T ;
Sauerborn, R .
PHARMACOGENOMICS, 2002, 3 (02) :245-256
[8]   Exploring the metabolic and genetic control of gene expression on a genomic scale [J].
DeRisi, JL ;
Iyer, VR ;
Brown, PO .
SCIENCE, 1997, 278 (5338) :680-686
[9]  
G. O. Consortium, 2004, Nucleic Acids Res, V32, pD258, DOI DOI 10.1093/NAR/GKH036
[10]   A Bayesian method for identifying missing enzymes in predicted metabolic pathway databases [J].
Green, ML ;
Karp, PD .
BMC BIOINFORMATICS, 2004, 5 (1)