Visualisation and graph-theoretic analysis of a large-scale protein structural interactome

被引:20
作者
Bolser, D
Dafas, P
Harrington, R
Park, J
Schroeder, M [1 ]
机构
[1] City Univ London, Dept Comp, London EC1V 0HB, England
[2] MRC, Dunn Human Nutr Unit, Cambridge CB2 2XY, England
[3] Korea Adv Inst Sci & Technol, Dept Biosyst, Seoul, South Korea
关键词
D O I
10.1186/1471-2105-4-45
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Large-scale protein interaction maps provide a new, global perspective with which to analyse protein function. PSIMAP, the Protein Structural Interactome Map, is a database of all the structurally observed interactions between superfamilies of protein domains with known three-dimensional structure in the PDB. PSIMAP incorporates both functional and evolutionary information into a single network. Results: We present a global analysis of PSIMAP using several distinct network measures relating to centrality, interactivity, fault-tolerance, and taxonomic diversity. We found the following results: Centrality: we show that the center and barycenter of PSIMAP do not coincide, and that the superfamilies forming the barycenter relate to very general functions, while those constituting the center relate to enzymatic activity. Interactivity: we identify the P-loop and immunoglobulin superfamilies as the most highly interactive. We successfully use connectivity and cluster index, which characterise the connectivity of a superfamily's neighbourhood, to discover superfamilies of complex I and II. This is particularly significant as the structure of complex I is not yet solved. Taxonomic diversity: we found that highly interactive superfamilies are in general taxonomically very diverse and are thus amongst the oldest. Fault-tolerance: we found that the network is very robust as for the majority of superfamilies removal from the network will not break up the network. Conclusions: Overall, we can single out the P-loop containing nucleotide triphosphate hydrolases superfamily as it is the most highly connected and has the highest taxonomic diversity. In addition, this superfamily has the highest interaction rank, is the barycenter of the network (it has the shortest average path to every other superfamily in the network), and is an articulation vertex, whose removal will disconnect the network. More generally, we conclude that the graph-theoretic and taxonomic analysis of PSIMAP is an important step towards the understanding of protein function and could be an important tool for tracing the evolution of life at the molecular level.
引用
收藏
页数:22
相关论文
共 63 条
[1]  
ALEXANDROV NN, 1994, PROTEIN SCI, V3, P866
[2]   METABOLIC ABNORMALITIES IN COBALAMIN (VITAMIN-B(12) AND FOLATE-DEFICIENCY [J].
ALLEN, RH ;
STABLER, SP ;
SAVAGE, DG ;
LINDENBAUM, J .
FASEB JOURNAL, 1993, 7 (14) :1344-1353
[3]   Interrogating protein interaction networks through structural biology [J].
Aloy, P ;
Russell, RB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (09) :5896-5901
[4]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[5]   Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains [J].
Anantharaman, V ;
Koonin, EV ;
Aravind, L .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (05) :1271-1292
[6]   An automated method for finding molecular complexes in large protein interaction networks [J].
Bader, GD ;
Hogue, CW .
BMC BIOINFORMATICS, 2003, 4 (1)
[7]   BIND - a data specification for storing and describing biomolecular interactions, molecular complexes and pathways [J].
Bader, GD ;
Hogue, CWV .
BIOINFORMATICS, 2000, 16 (05) :465-477
[8]   DOMAIN SWAPPING - ENTANGLING ALLIANCES BETWEEN PROTEINS [J].
BENNETT, MJ ;
CHOE, S ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (08) :3127-3131
[9]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[10]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544