Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs

被引:342
作者
Kashtan, N
Itzkovitz, S
Milo, R
Alon, U [1 ]
机构
[1] Weizmann Inst Sci, Dept Mol Cell Biol, IL-76100 Rehovot, Israel
[2] Weizmann Inst Sci, Dept Phys Complex Syst, IL-76100 Rehovot, Israel
[3] Weizmann Inst Sci, Dept Comp Sci & Appl Math, IL-76100 Rehovot, Israel
基金
以色列科学基金会;
关键词
D O I
10.1093/bioinformatics/bth163
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Biological and engineered networks have recently been shown to display network motifs: a small set of characteristic patterns that occur much more frequently than in randomized networks with the same degree sequence. Network motifs were demonstrated to play key information processing roles in biological regulation networks. Existing algorithms for detecting network motifs act by exhaustively enumerating all subgraphs with a given number of nodes in the network. The runtime of such algorithms increases strongly with network size. Here, we present a novel algorithm that allows estimation of subgraph concentrations and detection of network motifs at a runtime that is asymptotically independent of the network size. This algorithm is based on random sampling of subgraphs. Network motifs are detected with a surprisingly small number of samples in a wide variety of networks. Our method can be applied to estimate the concentrations of larger subgraphs in larger networks than was previously possible with exhaustive enumeration algorithms. We present results for high-order motifs in several biological networks and discuss their possible functions.
引用
收藏
页码:1746 / 1758
页数:13
相关论文
共 47 条
[1]  
ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
[2]  
Akkoyunlu E. A., 1973, SIAM Journal on Computing, V2, P1, DOI 10.1137/0202001
[3]   THE ALGORITHMIC ASPECTS OF THE REGULARITY LEMMA [J].
ALON, N ;
DUKE, RA ;
LEFMANN, H ;
RODL, V ;
YUSTER, R .
JOURNAL OF ALGORITHMS, 1994, 16 (01) :80-109
[4]   Finding and counting given length cycles [J].
Alon, N ;
Yuster, R ;
Zwick, U .
ALGORITHMICA, 1997, 17 (03) :209-223
[5]  
[Anonymous], UNIFORM GENERATION R
[6]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[7]  
BEZEM GJ, 1987, THESIS U UTRECHT UTR
[8]  
BOSILJKA T, 2002, ADV COMPLEX SYST, V5, P445
[9]   ESTIMATING THE NUMBER OF SPECIES - A REVIEW [J].
BUNGE, J ;
FITZPATRICK, M .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :364-373
[10]  
CHAUDHURI S, 1998, P ACM SIGMOD INT C M, P436