Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection

被引:80
作者
Chivian, Dylan
Baker, David
机构
[1] Univ Washington, Dept Biochem, Seattle, WA 98195 USA
[2] Howard Hughes Med Inst, Seattle, WA USA
基金
英国惠康基金;
关键词
D O I
10.1093/nar/gkl480
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The accuracy of a homology model based on the structure of a distant relative or other topologically equivalent protein is primarily limited by the quality of the alignment. Here we describe a systematic approach for sequence-to-structure alignment, called 'K*Sync', in which alignments are generated by dynamic programming using a scoring function that combines information on many protein features, including a novel measure of how obligate a sequence region is to the protein fold. By systematically varying the weights on the different features that contribute to the alignment score, we generate very large ensembles of diverse alignments, each optimal under a particular constellation of weights. We investigate a variety of approaches to select the best models from the ensemble, including consensus of the alignments, a hydrophobic burial measure, low- and high-resolution energy functions, and combinations of these evaluation methods. The effect on model quality and selection resulting from loop modeling and backbone optimization is also studied. The performance of the method on a benchmark set is reported and shows the approach to be effective at both generating and selecting accurate alignments. The method serves as the foundation of the homology modeling module in the Robetta server.
引用
收藏
页数:18
相关论文
共 89 条
  • [1] ALTSCHUL SF, 1986, B MATH BIOL, V48, P603, DOI 10.1016/S0092-8240(86)90010-8
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [4] Bonneau R, 2001, PROTEINS, V43, P1, DOI 10.1002/1097-0134(20010401)43:1<1::AID-PROT1012>3.0.CO
  • [5] 2-A
  • [6] Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation
    Bradley, P
    Chivian, D
    Meiler, J
    Misura, KMS
    Rohl, CA
    Schief, WR
    Wedemeyer, WJ
    Schueler-Furman, O
    Murphy, P
    Schonbrun, J
    Strauss, CEM
    Baker, D
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 : 457 - 468
  • [7] Cyclic coordinate descent: A robotics algorithm for protein loop closure
    Canutescu, AA
    Dunbrack, RL
    [J]. PROTEIN SCIENCE, 2003, 12 (05) : 963 - 972
  • [8] Prediction of CASP6 structures using automated Robetta protocols
    Chivian, D
    Kim, DE
    Malmström, L
    Schonbrun, J
    Rohl, CA
    Baker, D
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 : 157 - 166
  • [9] Automated prediction of CASP-5 structures using the Robetta server
    Chivian, D
    Kim, DE
    Malmström, L
    Bradley, P
    Robertson, T
    Murphy, P
    Strauss, CEM
    Bonneau, R
    Rohl, CA
    Baker, D
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 : 524 - 533
  • [10] In silico protein recombination:: Enhancing template and sequence alignment selection for comparative protein modelling
    Contreras-Moreira, B
    Fitzjohn, PW
    Bates, PA
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2003, 328 (03) : 593 - 608