Model quality assessment using distance constraints from alignments

被引:18
作者
Paluszewski, Martin [2 ]
Karplus, Kevin [1 ]
机构
[1] Univ Calif Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA
[2] Univ Copenhagen, Dept Comp Sci, Copenhagen, Denmark
关键词
MQA; model quality assessment; distance constraints; protein structure prediction; metaserver; LOCAL-STRUCTURE; FOLD-RECOGNITION; PROTEIN; PREDICTION; SEQUENCE; TASSER;
D O I
10.1002/prot.22262
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Given a set of alternative models for a specific protein sequence, the model quality assessment (MQA) problem asks for an assignment of scores to each model in the set. A good MQA program assigns these scores such that they correlate well with real quality of the models, ideally scoring best that model which is closest to the true structure. In this article, we present a new approach for addressing the MQA problem. It is based on distance constraints extracted from alignments to templates of known structure, and is implemented in the Undertaker program for protein structure prediction. One novel feature is that we extract noncontact constraints as well as contact constraints. We describe how the distance constraint extraction is done and we show how they can be used to address the MQA problem. We have compared our method on CASP7 targets and the results show that our method is at least comparable with the best MQA methods that were assessed at CASP7. We also propose a new evaluation measure, Kendall's tau, that is more interpretable than conventional measures used for evaluating MQA methods (Pearson's r and Spearman's rho). We show clear examples where Kendall's r agrees much more with our intuition of a correct MQA, and we therefore propose that Kendall's T be used for future CASP MQA assessments.
引用
收藏
页码:540 / 549
页数:10
相关论文
共 22 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]  
ARCHIE J, 2009, STRUCT FUNCT BIOINFO, V75, P550
[4]   Free modeling with Rosetta in CASP6 [J].
Bradley, P ;
Malmström, L ;
Qian, B ;
Schonbrun, J ;
Chivian, D ;
Kim, DE ;
Meiler, K ;
Misura, KMS ;
Baker, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 :128-134
[5]   A graph-theory algorithm for rapid protein side-chain prediction [J].
Canutescu, AA ;
Shelenkov, AA ;
Dunbrack, RL .
PROTEIN SCIENCE, 2003, 12 (09) :2001-2014
[6]   THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS [J].
CHOTHIA, C ;
LESK, AM .
EMBO JOURNAL, 1986, 5 (04) :823-826
[7]   Assessment of predictions in the model quality assessment category [J].
Cozzetto, Domenico ;
Kryshtafovych, Andriy ;
Ceriani, Michele ;
Tramontano, Anna .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 :175-183
[8]   Hidden Markov models that use predicted local structure for fold recognition: Alphabets of backbone geometry [J].
Karchin, R ;
Cline, M ;
Mandel-Gutfreund, Y ;
Karplus, K .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 51 (04) :504-514
[9]   SAM-T04: What is new in protein-structure prediction for CASP6 [J].
Karplus, K ;
Katzman, S ;
Shackleford, G ;
Koeva, M ;
Draper, J ;
Barnes, B ;
Soriano, M ;
Hughey, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 :135-142
[10]   Combining local-structure, fold-recognition, and new fold methods for protein structure prediction [J].
Karplus, K ;
Karchin, R ;
Draper, J ;
Casper, J ;
Mandel-Gutfreund, Y ;
Diekhans, M ;
Hughey, R .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :491-496