Evaluation of model quality predictions in CASP9

被引:63
作者
Kryshtafovych, Andriy [1 ]
Fidelis, Krzysztof [1 ]
Tramontano, Anna [2 ]
机构
[1] Univ Calif Davis, Genome Ctr, Davis, CA 95616 USA
[2] Univ Roma La Sapienza, Dept Phys, I-00185 Rome, Italy
关键词
CASP; QA; model quality assessment; protein structure modeling; protein structure prediction; OPERATING CHARACTERISTIC CURVES; SCORING FUNCTION; PROTEIN MODELS; RECOGNITION; TARGET; SERVER; COMPLEXES; PCONS;
D O I
10.1002/prot.23180
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
CASP has been assessing the state of the art in the a priori estimation of accuracy of protein structure prediction since 2006. The inclusion of model quality assessment category in CASP contributed to a rapid development of methods in this area. In the last experiment, 46 quality assessment groups tested their approaches to estimate the accuracy of protein models as a whole and/or on a per-residue basis. We assessed the performance of these methods predominantly on the basis of the correlation between the predicted and observed quality of the models on both global and local scales. The ability of the methods to identify the models closest to the best one, to differentiate between good and bad models, and to identify well modeled regions was also analyzed. Our evaluations demonstrate that even though global quality assessment methods seem to approach perfection point (weighted average per-target Pearson's correlation coefficients are as high as 0.97 for the best groups), there is still room for improvement. First, all top-performing methods use consensus approaches to generate quality estimates, and this strategy has its own limitations. Second, the methods that are based on the analysis of individual models lag far behind clustering techniques and need a boost in performance. The methods for estimating per-residue accuracy of models are less accurate than global quality assessment methods, with an average weighted per-model correlation coefficient in the range of 0.63-0.72 for the best 10 groups. Proteins 2011; 79(Suppl 10): 91-106. (C) 2011 Wiley-Liss, Inc.
引用
收藏
页码:91 / 106
页数:16
相关论文
共 59 条
[1]   Distantly related lipocalins share two conserved clusters of hydrophobic residues: use in homology modeling [J].
Adam, Benoit ;
Charloteaux, Benoit ;
Beaufays, Jerome ;
Vanhamme, Luc ;
Godfroid, Edmond ;
Brasseur, Robert ;
Lins, Laurence .
BMC STRUCTURAL BIOLOGY, 2008, 8
[2]  
Adamczak R, 2011, J COMPUT BIOL 0118
[3]  
[Anonymous], J PROTEOME RES
[4]  
[Anonymous], ELECT STAT TXB
[5]   QMEAN: A comprehensive scoring function for model quality assessment [J].
Benkert, Pascal ;
Tosatto, Silvio C. E. ;
Schomburg, Dietmar .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) :261-277
[6]   Global and local model quality estimation at CASP8 using the scoring functions QMEAN and QMEANclust [J].
Benkert, Pascal ;
Tosatto, Silvio C. E. ;
Schwede, Torsten .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :173-180
[7]   QMEAN server for protein model quality estimation [J].
Benkert, Pascal ;
Kuenzli, Michael ;
Schwede, Torsten .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W510-W514
[8]   Comprehensive Structural and Functional Characterization of the Human Kinome by Protein Structure Modeling and Ligand Virtual Screening [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (10) :1839-1854
[9]   The PMDB Protein Model Database [J].
Castrignano, Tiziana ;
De Meo, Paolo D'Onorio ;
Cozzetto, Domenico ;
Talamo, Ivano Giuseppe ;
Tramontano, Anna .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D306-D309
[10]   Prediction of global and local quality of CASP8 models by MULTICOM series [J].
Cheng, Jianlin ;
Wang, Zheng ;
Tegge, Allison N. ;
Eickholt, Jesse .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :181-184