PIBASE: a comprehensive database of structurally defined protein interfaces

被引:131
作者
Davis, FP
Sali, A [1 ]
机构
[1] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[2] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, Grad Grp Biophys, San Francisco, CA 94143 USA
[3] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, Dept Pharmaceut Chem, San Francisco, CA 94143 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/bti277
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In recent years, the Protein Data Bank (PDB) has experienced rapid growth. To maximize the utility of the high resolution protein-protein interaction data stored in the PDB, we have developed PIBASE, a comprehensive relational database of structurally defined interfaces between pairs of protein domains. It is composed of binary interfaces extracted from structures in the PDB and the Probable Quaternary Structure server using domain assignments from the Structural Classification of Proteins and CATH fold classification systems. Results: PIBASE currently contains 158 915 interacting domain pairs between 105 061 domains from 2125 SCOP families. A diverse set of geometric, physiochemical and topologic properties are calculated for each complex, its domains, interfaces and binding sites. A subset of the interface properties are used to remove interface redundancy within PDB entries, resulting in 20 912 distinct domain-domain interfaces. The complexes are grouped into 989 topological classes based on their patterns of domain-domain contacts. The binary interfaces and their corresponding binding sites are categorized into 18 755 and 30 975 topological classes, respectively, based on the topology of secondary structure elements. The utility of the database is illustrated by outlining several current applications.
引用
收藏
页码:1901 / 1907
页数:7
相关论文
共 52 条
[21]   Principles of protein-protein interactions [J].
Jones, S ;
Thornton, JM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (01) :13-20
[22]   Protein domain interfaces: characterization and comparison with oligomeric protein interfaces [J].
Jones, S ;
Marin, A ;
Thornton, JM .
PROTEIN ENGINEERING, 2000, 13 (02) :77-82
[23]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637
[24]  
Karchin R, 2005, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, P397
[25]   A new, structurally nonredundant, diverse data set of protein-protein interfaces and its implications [J].
Keskin, O ;
Tsai, CJ ;
Wolfson, H ;
Nussinov, R .
PROTEIN SCIENCE, 2004, 13 (04) :1043-1055
[26]  
Lo Conte L, 1999, J MOL BIOL, V285, P2177
[27]   Multimeric threading-based prediction of protein-protein interactions on a genomic scale:: Application to the Saccharomyces cerevisiae proteome [J].
Lu, L ;
Arakaki, AK ;
Lu, H ;
Skolnick, J .
GENOME RESEARCH, 2003, 13 (06) :1146-1154
[28]   DBAli:: a database of protein structure alignments [J].
Martí-Renom, MA ;
Ilyin, VA ;
Sali, A .
BIOINFORMATICS, 2001, 17 (08) :746-747
[29]   JOY: protein sequence-structure representation and analysis [J].
Mizuguchi, K ;
Deane, CM ;
Blundell, TL ;
Johnson, MS ;
Overingon, JP .
BIOINFORMATICS, 1998, 14 (07) :617-623
[30]  
MURZIN AG, 1995, J MOL BIOL, V247, P536, DOI 10.1016/S0022-2836(05)80134-2