Threading without optimizing weighting factors for scoring function

被引:9
作者
Yang, Yifeng David [1 ]
Park, Changsoon [2 ]
Kihara, Daisuke [1 ,3 ,4 ]
机构
[1] Purdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
[2] Chung Ang Univ, Coll Nat Sci, Dept Stat, Seoul 156756, South Korea
[3] Purdue Univ, Coll Sci, Dept Comp Sci, W Lafayette, IN 47907 USA
[4] Purdue Univ, Coll Sci, Markey Ctr Struct Biol, W Lafayette, IN 47907 USA
关键词
threading; protein structrure prediction; scoring function; weight optimization; protein folding;
D O I
10.1002/prot.22082
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Optimizing weighting factors for a linear combination of terms in a scoring function is a crucial step for success in developing a threading algorithm. Usually weighting factors are optimized to yield the highest success rate on a training dataset, and the determined constant values for the weighting factors are used for any target sequence. Here we explore completely different approaches to handle weighting factors for a scoring function of threading. Throughout this study we use a model system of gapless threading using a scoring function with two terms combined by a weighting factor, a main chain angle potential and a residue contact potential. First, we demonstrate that the optimal weighting factor for recognizing the native structure differs from target sequence to target sequence. Then, we present three novel threading methods which circumvent training dataset-based weighting factor optimization. The basic idea of the three methods is to employ different weighting factor values and finally select a template structure for a target sequence by examining characteristics of the distribution of scores computed by using the different weighting factor values. Interestingly, the success rate of our approaches is comparable to the conventional threading method where the weighting factor is optimized based on a training dataset. Moreover, when the size of the training set available for the conventional threading method is small, our approach often performs better. In addition, we predict a target-specific weighting factor optimal for a target sequence by an artificial neural network from features of the target sequence. Finally, we show that our novel methods can be used to assess the confidence of prediction of a conventional threading with an optimized constant weighting factor by considering consensus prediction between them. Implication to the underlined energy landscape of protein folding is discussed.
引用
收藏
页码:581 / 596
页数:16
相关论文
共 75 条
[31]   The PDB is a covering set of small protein structures [J].
Kihara, D ;
Skolnick, J .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 334 (04) :793-802
[32]   TOUCHSTONE:: An ab initio protein structure prediction method that uses threading-based tertiary restraints [J].
Kihara, D ;
Lu, H ;
Kolinski, A ;
Skolnick, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (18) :10125-10130
[33]   PREDICTION OF PROTEIN STRUCTURAL CLASS FROM THE AMINO-ACID-SEQUENCE [J].
KLEIN, P ;
DELISI, C .
BIOPOLYMERS, 1986, 25 (09) :1659-1672
[34]   Towards the high-resolution protein structure prediction. Fast refinement of reduced models with all-atom force field [J].
Kmiecik, Sebastian ;
Gront, Dominik ;
Kolinski, Andrzej .
BMC STRUCTURAL BIOLOGY, 2007, 7
[35]   Progress over the first decade of CASP experiments [J].
Kryshtafovych, A ;
Venclovas, C ;
Fidelis, K ;
Moult, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 :225-236
[36]  
Lo Conte L, 2002, NUCLEIC ACIDS RES, V30, P264
[37]   A distance-dependent atomic knowledge-based potential for improved protein structure selection [J].
Lu, H ;
Skolnick, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2001, 44 (03) :223-232
[38]   CONTACT POTENTIAL THAT RECOGNIZES THE CORRECT FOLDING OF GLOBULAR-PROTEINS [J].
MAIOROV, VN ;
CRIPPEN, GM .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 227 (03) :876-888
[39]   PROTEIN STRUCTURAL SIMILARITIES PREDICTED BY A SEQUENCE STRUCTURE COMPATIBILITY METHOD [J].
MATSUO, Y ;
NISHIKAWA, K .
PROTEIN SCIENCE, 1994, 3 (11) :2055-2063
[40]   ROSETTALIGAND: Protein-small molecule docking with full side-chain flexibility [J].
Meiler, Jens ;
Baker, David .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 65 (03) :538-548