PIBASE: a comprehensive database of structurally defined protein interfaces

被引:131
作者
Davis, FP
Sali, A [1 ]
机构
[1] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, Grad Grp Biophys, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, Dept Pharmaceut Chem, San Francisco, CA 94143 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/bti277
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In recent years, the Protein Data Bank (PDB) has experienced rapid growth. To maximize the utility of the high resolution protein-protein interaction data stored in the PDB, we have developed PIBASE, a comprehensive relational database of structurally defined interfaces between pairs of protein domains. It is composed of binary interfaces extracted from structures in the PDB and the Probable Quaternary Structure server using domain assignments from the Structural Classification of Proteins and CATH fold classification systems. Results: PIBASE currently contains 158 915 interacting domain pairs between 105 061 domains from 2125 SCOP families. A diverse set of geometric, physiochemical and topologic properties are calculated for each complex, its domains, interfaces and binding sites. A subset of the interface properties are used to remove interface redundancy within PDB entries, resulting in 20 912 distinct domain-domain interfaces. The complexes are grouped into 989 topological classes based on their patterns of domain-domain contacts. The binary interfaces and their corresponding binding sites are categorized into 18 755 and 30 975 topological classes, respectively, based on the topology of secondary structure elements. The utility of the database is illustrated by outlining several current applications.
引用
收藏
页码:1901 / 1907
页数:7
相关论文
共 52 条
[1]   LGL: Creating a map of protein function with an algorithm for visualizing very large biological networks [J].
Adai, AT ;
Date, SV ;
Wieland, S ;
Marcotte, EM .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 340 (01) :179-190
[2]   The cell as a collection of protein machines: Preparing the next generation of molecular biologists [J].
Alberts, B .
CELL, 1998, 92 (03) :291-294
[3]   The relationship between sequence and interaction divergence in proteins [J].
Aloy, P ;
Ceulemans, H ;
Stark, A ;
Russell, RB .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (05) :989-998
[4]   Interrogating protein interaction networks through structural biology [J].
Aloy, P ;
Russell, RB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (09) :5896-5901
[5]   AN INVESTIGATION OF PROTEIN SUBUNIT AND DOMAIN INTERFACES [J].
ARGOS, P .
PROTEIN ENGINEERING, 1988, 2 (02) :101-113
[6]  
Bader GD, 2003, NUCLEIC ACIDS RES, V31, P248, DOI 10.1093/nar/gkg056
[7]  
BERG M, 1998, COMPUTATIONAL GEOMET
[8]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[9]   Are protein-protein interfaces more conserved in sequence than the rest of the protein surface? [J].
Caffrey, DR ;
Somaroo, S ;
Hughes, JD ;
Mintseris, J ;
Huang, ES .
PROTEIN SCIENCE, 2004, 13 (01) :190-202
[10]  
Carugo O, 1997, PROTEIN SCI, V6, P2261