Protein interface classification by evolutionary analysis

被引:115
作者
Duarte, Jose M. [1 ]
Srebniak, Adam [2 ]
Schaerer, Martin A. [1 ]
Capitani, Guido [1 ]
机构
[1] Paul Scherrer Inst, CH-5232 Villigen, Switzerland
[2] Swiss Fed Inst Technol, SyBIT, Zurich, Switzerland
关键词
Protein structure; Protein-protein interfaces; Crystal interfaces; Classification; Evolutionary; Core residues; Web server; QUATERNARY STRUCTURE; KINASE DOMAIN; CONSERVATION; SEQUENCE; DATABASE; RECOGNITION; PREDICTION; GENERATION; MECHANISM; SUBSTRATE;
D O I
10.1186/1471-2105-13-334
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Background: Distinguishing biologically relevant interfaces from lattice contacts in protein crystals is a fundamental problem in structural biology. Despite efforts towards the computational prediction of interface character, many issues are still unresolved. Results: We present here a protein-protein interface classifier that relies on evolutionary data to detect the biological character of interfaces. The classifier uses a simple geometric measure, number of core residues, and two evolutionary indicators based on the sequence entropy of homolog sequences. Both aim at detecting differential selection pressure between interface core and rim or rest of surface. The core residues, defined as fully buried residues (>95% burial), appear to be fundamental determinants of biological interfaces: their number is in itself a powerful discriminator of interface character and together with the evolutionary measures it is able to clearly distinguish evolved biological contacts from crystal ones. We demonstrate that this definition of core residues leads to distinctively better results than earlier definitions from the literature. The stringent selection and quality filtering of structural and sequence data was key to the success of the method. Most importantly we demonstrate that a more conservative selection of homolog sequences - with relatively high sequence identities to the query - is able to produce a clearer signal than previous attempts. Conclusions: An evolutionary approach like the one presented here is key to the advancement of the field, which so far was missing an effective method exploiting the evolutionary character of protein interfaces. Its coverage and performance will only improve over time thanks to the incessant growth of sequence databases. Currently our method reaches an accuracy of 89% in classifying interfaces of the Ponstingl 2003 datasets and it lends itself to a variety of useful applications in structural biology and bioinformatics. We made the corresponding software implementation available to the community as an easy-to-use graphical web interface at http://www.eppic-web.org.
引用
收藏
页数:16
相关论文
共 53 条
[1]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]
[Anonymous], An Open-Source Java Viewer for Chemical Structures in 3D
[3]
Dissecting subunit interfaces in homodimeric proteins [J].
Bahadur, RP ;
Chakrabarti, P ;
Rodier, F ;
Janin, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 (03) :708-719
[4]
A dissection of specific and non-specific protein - Protein interfaces [J].
Bahadur, RP ;
Chakrabarti, P ;
Rodier, F ;
Janin, J .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 336 (04) :943-955
[5]
Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[6]
DiMoVo: a Voronoi tessellation-based method for discriminating crystallographic and biological proteinprotein interactions [J].
Bernauer, Julie ;
Bahadur, Ranjit Prasad ;
Rodier, Francis ;
Janin, Joel ;
Poupon, Anne .
BIOINFORMATICS, 2008, 24 (05) :652-658
[7]
3D ultrastructure of the nuclear pore complex [J].
Bilokapic, Silvija ;
Schwartz, Thomas U. .
CURRENT OPINION IN CELL BIOLOGY, 2012, 24 (01) :86-91
[8]
Anatomy of hot spots in protein interfaces [J].
Bogan, AA ;
Thorn, KS .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 280 (01) :1-9
[9]
Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces [J].
Bordner, Andrew J. ;
Gorin, Andrey A. .
BMC BIOINFORMATICS, 2008, 9 (1)
[10]
Free R value: Cross-validation in crystallography [J].
Brunger, AT .
MACROMOLECULAR CRYSTALLOGRAPHY, PT B, 1997, 277 :366-396