Local graph alignment and motif search in biological networks

被引:123
作者
Berg, J [1 ]
Lässig, M [1 ]
机构
[1] Univ Cologne, Inst Theoret Phys, D-50937 Cologne, Germany
关键词
D O I
10.1073/pnas.0305199101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Interaction networks are of central importance in postgenomic molecular biology, with increasing amounts of data becoming available by high-throughput methods. Examples are gene regulatory networks or protein interaction maps. The main challenge in the analysis of these data is to read off biological functions from the topology of the network. Topological motifs, i.e., patterns occurring repeatedly at different positions in the network, have recently been identified as basic modules of molecular information processing. in this article, we discuss motifs derived from families of mutually similar but not necessarily identical patterns. We establish a statistical model for the occurrence of such motifs, from which we derive a scoring function for their statistical significance. Based on this scoring function, we develop a search algorithm for topological motifs called graph alignment, a procedure with some analogies to sequence alignment. The algorithm is applied to the gene regulation network of Escherichia coli.
引用
收藏
页码:14689 / 14694
页数:6
相关论文
共 21 条
[1]   Correlated random networks -: art. no. 228701 [J].
Berg, J ;
Lässig, M .
PHYSICAL REVIEW LETTERS, 2002, 89 (22) :228701-228701
[2]   Superparamagnetic clustering of data [J].
Blatt, M ;
Wiseman, S ;
Domany, E .
PHYSICAL REVIEW LETTERS, 1996, 76 (18) :3251-3254
[3]   Building a dictionary for genomes: Identification of presumptive regulatory sites by statistical analysis [J].
Bussemaker, HJ ;
Li, H ;
Siggia, ED .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10096-10100
[4]   Turnover of binding sites for transcription factors involved in early Drosophila development [J].
Costas, J ;
Casares, F ;
Vieira, J .
GENE, 2003, 310 :215-220
[5]  
Davidson E. H., 2001, Genomic regulatory systems: development and evolution
[6]  
Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
[7]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[8]  
Garey M. R., 1979, Computers and intractability. A guide to the theory of NP-completeness
[9]   Subgraphs in random networks [J].
Itzkovitz, S ;
Milo, R ;
Kashtan, N ;
Ziv, G ;
Alon, U .
PHYSICAL REVIEW E, 2003, 68 (02) :8
[10]   METHODS FOR ASSESSING THE STATISTICAL SIGNIFICANCE OF MOLECULAR SEQUENCE FEATURES BY USING GENERAL SCORING SCHEMES [J].
KARLIN, S ;
ALTSCHUL, SF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (06) :2264-2268