Graemlin: General and robust alignment of multiple large interaction networks

被引:203
作者
Flannick, Jason
Novak, Antal
Srinivasan, Balaji S.
McAdams, Harley H.
Batzoglou, Serafim [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Dev Biol, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
关键词
D O I
10.1101/gr.5235706
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The recent proliferation of protein interaction networks has motivated research into network alignment: the cross-species comparison of conserved functional modules. Previous studies have laid the foundations for such comparisons and demonstrated their power on a select set of sparse interaction networks. Recently, however, new computational techniques have produced hundreds of predicted interaction networks with interconnection densities that push existing alignment algorithms to their limits. To find conserved functional modules in these new networks, we have developed Gr ae mlin, the first algorithm capable of scalable multiple network alignment. Gr ae mlin's explicit model of functional evolution allows both the generalization of existing alignment scoring schemes and the location of conserved network topologies other than protein complexes and metabolic pathways. To assess Gr ae mlin's performance, we have developed the first quantitative benchmarks for network alignment, which allow comparisons of algorithms in terms of their ability to recapitulate the KEGG database of conserved functional modules. We find that Gr ae mlin achieves substantial scalability gains over previous methods while improving sensitivity.
引用
收藏
页码:1169 / 1181
页数:13
相关论文
共 68 条
  • [11] Ultraconserved elements in the human genome
    Bejerano, G
    Pheasant, M
    Makunin, I
    Stephen, S
    Kent, WJ
    Mattick, JS
    Haussler, D
    [J]. SCIENCE, 2004, 304 (5675) : 1321 - 1325
  • [12] Aligning multiple genomic sequences with the threaded blockset aligner
    Blanchette, M
    Kent, WJ
    Riemer, C
    Elnitski, L
    Smit, AFA
    Roskin, KM
    Baertsch, R
    Rosenbloom, K
    Clawson, H
    Green, ED
    Haussler, D
    Miller, W
    [J]. GENOME RESEARCH, 2004, 14 (04) : 708 - 715
  • [13] Apparent dependence of protein evolutionary rate on number of interactions is linked to biases in protein-protein interactions data sets
    Bloom, JD
    Adami, C
    [J]. BMC EVOLUTIONARY BIOLOGY, 2003, 3 (1)
  • [14] GO::TermFinder - open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes
    Boyle, EI
    Weng, SA
    Gollub, J
    Jin, H
    Botstein, D
    Cherry, JM
    Sherlock, G
    [J]. BIOINFORMATICS, 2004, 20 (18) : 3710 - 3715
  • [15] MAVID: Constrained ancestral alignment of multiple sequences
    Bray, N
    Pachter, L
    [J]. GENOME RESEARCH, 2004, 14 (04) : 693 - 699
  • [16] LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA
    Brudno, M
    Do, CB
    Cooper, GM
    Kim, MF
    Davydov, E
    Green, ED
    Sidow, A
    Batzoglou, S
    [J]. GENOME RESEARCH, 2003, 13 (04) : 721 - 731
  • [17] Chiaromonte F, 2002, Pac Symp Biocomput, P115
  • [18] Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms
    Christie, KR
    Weng, S
    Balakrishnan, R
    Costanzo, MC
    Dolinski, K
    Dwight, SS
    Engel, SR
    Feierbach, B
    Fisk, DG
    Hirschman, JE
    Hong, EL
    Issel-Tarver, L
    Nash, R
    Sethuraman, A
    Starr, B
    Theesfeld, CL
    Andrada, R
    Binkley, G
    Dong, Q
    Lane, C
    Schroeder, M
    Botstein, D
    Cherry, JM
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D311 - D314
  • [19] Thirty years of graph matching in pattern recognition
    Conte, D
    Foggia, P
    Sansone, C
    Vento, M
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (03) : 265 - 298
  • [20] Distribution and intensity of constraint in mammalian genomic sequence
    Cooper, GM
    Stone, EA
    Asimenos, G
    Green, ED
    Batzoglou, S
    Sidow, A
    [J]. GENOME RESEARCH, 2005, 15 (07) : 901 - 913