The other 90% of the protein: Assessment beyond the Cαs for CASP8 template-based and high-accuracy models

被引:60
作者
Keedy, Daniel A. [1 ]
Williams, Christopher J. [1 ]
Headd, Jeffrey J. [1 ,2 ]
Arendall, W. Bryan, III [1 ]
Chen, Vincent B. [1 ]
Kapral, Gary J. [1 ]
Gillespie, Robert A. [1 ]
Block, Jeremy N. [1 ]
Zemla, Adam [3 ]
Richardson, David C. [1 ]
Richardson, Jane S. [1 ]
机构
[1] Duke Univ, Med Ctr, Dept Biochem, Durham, NC 27710 USA
[2] Duke Univ, Computat Biol & Bioinformat Program, Durham, NC 27708 USA
[3] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
基金
美国国家卫生研究院;
关键词
homology modeling; protein structure prediction; all-atom contacts; full-model assessment; STRUCTURE VALIDATION; ATOM CONTACTS; PREDICTIONS;
D O I
10.1002/prot.22551
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
For template-based modeling in the CASP8 Critical Assessment of Techniques for Protein Structure Prediction, this work develops and applies six new full-model metrics. They are designed to complement and add value to the traditional template-based assessment by the global distance test (GDT) and related scores (based on multiple superpositions of C alpha atoms between target structure and predictions labeled "Model 1"). The new metrics evaluate each predictor group on each target, using all atoms of their best model with above-average GDT. Two metrics evaluate how "protein-like" the predicted model is: the MolProbity score used for validating experimental structures, and a mainchain reality score using all-atom steric clashes, bond length and angle outliers, and backbone dihedrals. Four other new metrics evaluate match of model to target for mainchain and side-chain hydrogen bonds, side-chain end positioning, and side-chain rotamers. Group-average Z-score across the six full-model measures is averaged with group-average GDT Z-score to produce the overall ranking for full-model, high-accuracy performance. Separate assessments are reported for specific aspects of predictor-group performance, such as robustness of approximately correct template or fold identification, and self-scoring ability at identifying the best of their models. Fold identification is distinct from but correlated with group-average GDT Z-score if target difficulty is taken into account, whereas self-scoring is done best by servers and is uncorrelated with GDT performance. Outstanding individual models on specific targets are identified and discussed. Predictor groups excelled at different aspects, highlighting the diversity of current methodologies. However, good full-model scores correlate robustly with high C alpha accuracy.
引用
收藏
页码:29 / 49
页数:21
相关论文
共 38 条
[1]  
Arendall W. Bryan III, 2005, Journal of Structural and Functional Genomics, V6, P1, DOI 10.1007/s10969-005-3138-4
[2]   Assessment of CASP8 structure predictions for template free targets [J].
Ben-David, Moshe ;
Noivirt-Brik, Orly ;
Paz, Aviv ;
Prilusky, Jaime ;
Sussman, Joel L. ;
Levy, Yaakov .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :50-65
[3]   The Protein Data Bank and the challenge of structural genomics [J].
Berman, HM ;
Bhat, TN ;
Bourne, PE ;
Feng, ZK ;
Gilliland, G ;
Weissig, H ;
Westbrook, J .
NATURE STRUCTURAL BIOLOGY, 2000, 7 (Suppl 11) :957-959
[4]   KinImmerse: Macromolecular VR for NMR ensembles [J].
Block, Jeremy N. ;
Zielinski, David J. ;
Chen, Vincent B. ;
Davis, Ian W. ;
Vinson, Claire ;
Brady, Rachael ;
Richardson, Jane S. ;
Richardson, David C. .
SOURCE CODE FOR BIOLOGY AND MEDICINE, 2009, 4 (01)
[5]   A graph-theory algorithm for rapid protein side-chain prediction [J].
Canutescu, AA ;
Shelenkov, AA ;
Dunbrack, RL .
PROTEIN SCIENCE, 2003, 12 (09) :2001-2014
[6]   Evaluation of template-based models in CASP8 with standard measures [J].
Cozzetto, Domenico ;
Kryshtafovych, Andriy ;
Fidelis, Krzysztof ;
Moult, John ;
Rost, Burkhard ;
Tramontano, Anna .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :18-28
[7]   MolProbity: all-atom contacts and structure validation for proteins and nucleic acids [J].
Davis, Ian W. ;
Leaver-Fay, Andrew ;
Chen, Vincent B. ;
Block, Jeremy N. ;
Kapral, Gary J. ;
Wang, Xueyi ;
Murray, Laura W. ;
Arendall, W. Bryan, III ;
Snoeyink, Jack ;
Richardson, Jane S. ;
Richardson, David C. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W375-W383
[8]  
Dill K., 2002, Molecular Driving Forces: Statistical Thermodynamics in Chemistry Biology
[9]   ACCURATE BOND AND ANGLE PARAMETERS FOR X-RAY PROTEIN-STRUCTURE REFINEMENT [J].
ENGH, RA ;
HUBER, R .
ACTA CRYSTALLOGRAPHICA SECTION A, 1991, 47 :392-400
[10]   Solution NMR structures of IgG binding domains with artificially evolved high levels of sequence identity but different folds [J].
He, YN ;
Yeh, DC ;
Alexander, P ;
Bryan, PN ;
Orban, J .
BIOCHEMISTRY, 2005, 44 (43) :14055-14061