CAD-score: A new contact area difference-based function for evaluation of protein structural models

被引:103
作者
Olechnovic, Kliment [1 ]
Kulberkyte, Eleonora [1 ]
Venclovas, Ceslovas [1 ]
机构
[1] Vilnius State Univ, Inst Biotechnol, LT-02241 Vilnius, Lithuania
关键词
protein structure; model accuracy; residue-residue contacts; physical realism; domain rearrangement; multi-subunit structures; CASP; GDT; STRUCTURE PREDICTIONS; CASP8; ACCURACY;
D O I
10.1002/prot.24172
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Evaluation of protein models against the native structure is essential for the development and benchmarking of protein structure prediction methods. Although a number of evaluation scores have been proposed to date, many aspects of model assessment still lack desired robustness. In this study we present CAD-score, a new evaluation function quantifying differences between physical contacts in a model and the reference structure. The new score uses the concept of residueresidue contact area difference (CAD) introduced by Abagyan and Totrov (J Mol Biol 1997; 268:678685). Contact areas, the underlying basis of the score, are derived using the Voronoi tessellation of protein structure. The newly introduced CAD-score is a continuous function, confined within fixed limits, free of any arbitrary thresholds or parameters. The built-in logic for treatment of missing residues allows consistent ranking of models of any degree of completeness. We tested CAD-score on a large set of diverse models and compared it to GDT-TS, a widely accepted measure of model accuracy. Similarly to GDT-TS, CAD-score showed a robust performance on single-domain proteins, but displayed a stronger preference for physically more realistic models. Unlike GDT-TS, the new score revealed a balanced assessment of domain rearrangement, removing the necessity for different treatment of single-domain, multi-domain, and multi-subunit structures. Moreover, CAD-score makes it possible to assess the accuracy of inter-domain or inter-subunit interfaces directly. In addition, the approach offers an alternative to the superposition-based model clustering. The CAD-score implementation is available both as a web server and a standalone software package at http://www.ibt.lt/bioinformatics/cad-score/. Proteins 2013. (c) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:149 / 162
页数:14
相关论文
共 30 条
[1]   Contact area difference (CAD): A robust measure to evaluate accuracy of protein models [J].
Abagyan, RA ;
Totrov, MM .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (03) :678-685
[2]   Predictions without templates: New folds, secondary structure, and contacts in CASP5 [J].
Aloy, P ;
Stark, A ;
Hadley, S ;
Russell, RB .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :436-456
[3]   Assessment of CASP8 structure predictions for template free targets [J].
Ben-David, Moshe ;
Noivirt-Brik, Orly ;
Paz, Aviv ;
Prilusky, Jaime ;
Sussman, Joel L. ;
Levy, Yaakov .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :50-65
[4]   MolProbity: all-atom structure validation for macromolecular crystallography [J].
Chen, Vincent B. ;
Arendall, W. Bryan, III ;
Headd, Jeffrey J. ;
Keedy, Daniel A. ;
Immormino, Robert M. ;
Kapral, Gary J. ;
Murray, Laura W. ;
Richardson, Jane S. ;
Richardson, David C. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2010, 66 :12-21
[5]   Evaluation of template-based models in CASP8 with standard measures [J].
Cozzetto, Domenico ;
Kryshtafovych, Andriy ;
Fidelis, Krzysztof ;
Moult, John ;
Rost, Burkhard ;
Tramontano, Anna .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :18-28
[6]   Assessment of CASP7 structure predictions for template free targets [J].
Jauch, Ralf ;
Yeo, Hock Chuan ;
Kolatkar, Prasanna R. ;
Clarke, Neil D. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 :57-67
[7]   SOLUTION FOR BEST ROTATION TO RELATE 2 SETS OF VECTORS [J].
KABSCH, W .
ACTA CRYSTALLOGRAPHICA SECTION A, 1976, 32 (SEP1) :922-923
[8]   The other 90% of the protein: Assessment beyond the Cαs for CASP8 template-based and high-accuracy models [J].
Keedy, Daniel A. ;
Williams, Christopher J. ;
Headd, Jeffrey J. ;
Arendall, W. Bryan, III ;
Chen, Vincent B. ;
Kapral, Gary J. ;
Gillespie, Robert A. ;
Block, Jeremy N. ;
Zemla, Adam ;
Richardson, David C. ;
Richardson, Jane S. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :29-49
[9]   Euclidean Voronoi diagram of 3D balls and its computation via tracing edges [J].
Kim, DS ;
Cho, Y ;
Kim, D .
COMPUTER-AIDED DESIGN, 2005, 37 (13) :1412-1424
[10]   CASP9 target classification [J].
Kinch, Lisa N. ;
Shi, Shuoyong ;
Cheng, Hua ;
Cong, Qian ;
Pei, Jimin ;
Mariani, Valerio ;
Schwede, Torsten ;
Grishin, Nick V. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 :21-36