Threading without optimizing weighting factors for scoring function

被引:9
作者
Yang, Yifeng David [1 ]
Park, Changsoon [2 ]
Kihara, Daisuke [1 ,3 ,4 ]
机构
[1] Purdue Univ, Coll Sci, Dept Biol Sci, W Lafayette, IN 47907 USA
[2] Chung Ang Univ, Coll Nat Sci, Dept Stat, Seoul 156756, South Korea
[3] Purdue Univ, Coll Sci, Dept Comp Sci, W Lafayette, IN 47907 USA
[4] Purdue Univ, Coll Sci, Markey Ctr Struct Biol, W Lafayette, IN 47907 USA
关键词
threading; protein structrure prediction; scoring function; weight optimization; protein folding;
D O I
10.1002/prot.22082
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Optimizing weighting factors for a linear combination of terms in a scoring function is a crucial step for success in developing a threading algorithm. Usually weighting factors are optimized to yield the highest success rate on a training dataset, and the determined constant values for the weighting factors are used for any target sequence. Here we explore completely different approaches to handle weighting factors for a scoring function of threading. Throughout this study we use a model system of gapless threading using a scoring function with two terms combined by a weighting factor, a main chain angle potential and a residue contact potential. First, we demonstrate that the optimal weighting factor for recognizing the native structure differs from target sequence to target sequence. Then, we present three novel threading methods which circumvent training dataset-based weighting factor optimization. The basic idea of the three methods is to employ different weighting factor values and finally select a template structure for a target sequence by examining characteristics of the distribution of scores computed by using the different weighting factor values. Interestingly, the success rate of our approaches is comparable to the conventional threading method where the weighting factor is optimized based on a training dataset. Moreover, when the size of the training set available for the conventional threading method is small, our approach often performs better. In addition, we predict a target-specific weighting factor optimal for a target sequence by an artificial neural network from features of the target sequence. Finally, we show that our novel methods can be used to assess the confidence of prediction of a conventional threading with an optimized constant weighting factor by considering consensus prediction between them. Implication to the underlined energy landscape of protein folding is discussed.
引用
收藏
页码:581 / 596
页数:16
相关论文
共 75 条
[1]   Combining prediction of secondary structure and solvent accessibility in proteins [J].
Adamczak, R ;
Porollo, A ;
Meller, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 59 (03) :467-475
[2]  
Akutsu T, 1998, Pac Symp Biocomput, P413
[3]   POEM: Parameter optimization using ensemble methods: Application to target specific scoring functions [J].
Antes, I ;
Merkwirth, C ;
Lengauer, T .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (05) :1291-1302
[4]   QMEAN: A comprehensive scoring function for model quality assessment [J].
Benkert, Pascal ;
Tosatto, Silvio C. E. ;
Schomburg, Dietmar .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) :261-277
[5]  
Betancourt MR, 1999, PROTEIN SCI, V8, P361
[6]   Local propensities and statistical potentials of backbone dihedral angles in proteins [J].
Betancourt, MR ;
Skolnick, J .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 342 (02) :635-649
[7]   A reduced protein model with accurate native-structure identification ability [J].
Betancourt, MR .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 (04) :889-907
[8]  
Bharath R., 1994, NEURAL NETWORK COMPU
[9]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[10]   Optimizing energy potentials for success in protein tertiary structure prediction [J].
Chiu, TL ;
Goldstein, RA .
FOLDING & DESIGN, 1998, 3 (03) :223-228