Toward optimal fragment generations for ab initio protein structure assembly

被引:170
作者
Xu, Dong [1 ]
Zhang, Yang [1 ,2 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
ab initio folding; contact prediction; secondary structure prediction; structural fragments; STRUCTURE PREDICTION; SECONDARY STRUCTURE; ALGORITHM; FOLD; RECOGNITION; INFORMATION; SEQUENCES; ALIGNMENT; MODELS;
D O I
10.1002/prot.24179
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Fragment assembly using structural motifs excised from other solved proteins has shown to be an efficient method for ab initio protein-structure prediction. However, how to construct accurate fragments, how to derive optimal restraints from fragments, and what the best fragment length is are the basic issues yet to be systematically examined. In this work, we developed a gapless-threading method to generate position-specific structure fragments. Distance profiles and torsion angle pairs are then derived from the fragments by statistical consistency analysis, which achieved comparable accuracy with the machine-learning-based methods although the fragments were taken from unrelated proteins. When measured by both accuracies of the derived distance profiles and torsion angle pairs, we come to a consistent conclusion that the optimal fragment length for structural assembly is around 10, and at least 100 fragments at each location are needed to achieve optimal structure assembly. The distant profiles and torsion angle pairs as derived by the fragments have been successfully used in QUARK for ab initio protein structure assembly and are provided by the QUARK online server at http://zhanglab.ccmb. med.umich.edu/QUARK/. Proteins 2013. (C) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:229 / 239
页数:11
相关论文
共 50 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Reconstruction of protein backbones from the BriX collection of canonical protein fragments [J].
Baeten, Lies ;
Reumers, Joke ;
Tur, Vicente ;
Stricher, Francois ;
Lenaerts, Tom ;
Serrano, Luis ;
Rousseau, Frederic ;
Schymkowitz, Joost .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (05)
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[5]   AN EVOLUTIONARY APPROACH TO FOLDING SMALL ALPHA-HELICAL PROTEINS THAT USES SEQUENCE INFORMATION AND AN EMPIRICAL GUIDING FITNESS FUNCTION [J].
BOWIE, JU ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (10) :4436-4440
[6]   CHARMM - A PROGRAM FOR MACROMOLECULAR ENERGY, MINIMIZATION, AND DYNAMICS CALCULATIONS [J].
BROOKS, BR ;
BRUCCOLERI, RE ;
OLAFSON, BD ;
STATES, DJ ;
SWAMINATHAN, S ;
KARPLUS, M .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1983, 4 (02) :187-217
[7]  
CASE DA, 1997, AMBER 5 0
[8]   Improved residue contact prediction using support vector machines and a large feature set [J].
Cheng, Jianlin ;
Baldi, Pierre .
BMC BIOINFORMATICS, 2007, 8 (1)
[9]   Assessment of domain boundary predictions and the prediction of intramolecular contacts in CASP8 [J].
Ezkurdia, Iakes ;
Grana, Osvaldo ;
Izarzugaza, Jose M. G. ;
Tress, Michael L. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :196-209
[10]   The dual role of fragments in fragment-assembly methods for de novo protein structure prediction [J].
Handl, Julia ;
Knowles, Joshua ;
Vernon, Robert ;
Baker, David ;
Lovell, Simon C. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2012, 80 (02) :490-504