Optimal contact definition for reconstruction of Contact Maps

被引:43
作者
Duarte, Jose M. [1 ,2 ]
Sathyapriya, Rajagopal [1 ]
Stehr, Henning [1 ]
Filippis, Ioannis [1 ,3 ]
Lappe, Michael [1 ]
机构
[1] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
[2] Paul Scherrer Inst, Lab Biomol Res, CH-5232 Villigen, Switzerland
[3] Univ London Imperial Coll Sci Technol & Med, Ctr Bioinformat, Div Mol Biosci, London SW7 2AZ, England
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
DISTANCE GEOMETRY; PROTEIN-STRUCTURE; CONFORMATIONS; ALIGNMENTS; POTENTIALS; PREDICTION; ALGORITHM;
D O I
10.1186/1471-2105-11-283
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Contact maps have been extensively used as a simplified representation of protein structures. They capture most important features of a protein's fold, being preferred by a number of researchers for the description and study of protein structures. Inspired by the model's simplicity many groups have dedicated a considerable amount of effort towards contact prediction as a proxy for protein structure prediction. However a contact map's biological interest is subject to the availability of reliable methods for the 3-dimensional reconstruction of the structure. Results: We use an implementation of the well-known distance geometry protocol to build realistic protein 3-dimensional models from contact maps, performing an extensive exploration of many of the parameters involved in the reconstruction process. We try to address the questions: a) to what accuracy does a contact map represent its corresponding 3D structure, b) what is the best contact map representation with regard to reconstructability and c) what is the effect of partial or inaccurate contact information on the 3D structure recovery. Our results suggest that contact maps derived from the application of a distance cutoff of 9 to 11 angstrom around the C-beta atoms constitute the most accurate representation of the 3D structure. The reconstruction process does not provide a single solution to the problem but rather an ensemble of conformations that are within 2 angstrom RMSD of the crystal structure and with lower values for the pairwise average ensemble RMSD. Interestingly it is still possible to recover a structure with partial contact information, although wrong contacts can lead to dramatic loss in reconstruction fidelity. Conclusions: Thus contact maps represent a valid approximation to the structures with an accuracy comparable to that of experimental methods. The optimal contact definitions constitute key guidelines for methods based on contact maps such as structure prediction through contacts and structural alignments based on maximum contact map overlap.
引用
收藏
页数:10
相关论文
共 43 条
[1]   PDP: protein domain parser [J].
Alexandrov, N ;
Shindyalov, I .
BIOINFORMATICS, 2003, 19 (03) :429-430
[2]  
[Anonymous], 1953, Theory and Applications of Distance Geometry
[3]   Homology modelling by distance geometry [J].
Aszodi, A ;
Taylor, WR .
FOLDING & DESIGN, 1996, 1 (05) :325-334
[4]  
Bartoli Lisa, 2008, V413, P199
[5]   QMEAN: A comprehensive scoring function for model quality assessment [J].
Benkert, Pascal ;
Tosatto, Silvio C. E. ;
Schomburg, Dietmar .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) :261-277
[6]   Residue contact-count potentials are as effective as residue-residue contact-type potentials for ranking protein decoys [J].
Bolser, Dan M. ;
Filippis, Ioannis ;
Stehr, Henning ;
Duarte, Jose ;
Lappe, Michael .
BMC STRUCTURAL BIOLOGY, 2008, 8
[7]   1001 optimal PDB structure alignments: Integer programming methods for finding the maximum contact map overlap [J].
Caprara, A ;
Carr, R ;
Istrail, S ;
Lancia, G ;
Walenz, B .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (01) :27-52
[8]   A three-state prediction of single point mutations on protein stability changes [J].
Capriotti, Emidio ;
Fariselli, Piero ;
Rossi, Ivan ;
Casadio, Rita .
BMC BIOINFORMATICS, 2008, 9
[9]   Fidelity of the protein structure reconstruction from inter-residue proximity constraints [J].
Chen, Yiwen ;
Ding, Feng ;
Dokholyan, Nikolay V. .
JOURNAL OF PHYSICAL CHEMISTRY B, 2007, 111 (25) :7432-7438
[10]   Improved residue contact prediction using support vector machines and a large feature set [J].
Cheng, Jianlin ;
Baldi, Pierre .
BMC BIOINFORMATICS, 2007, 8 (1)