A three-state prediction of single point mutations on protein stability changes

被引:258
作者
Capriotti, Emidio [3 ]
Fariselli, Piero [1 ]
Rossi, Ivan [1 ,2 ]
Casadio, Rita [1 ]
机构
[1] Univ Bologna, CIRB Dept Biol, Lab Biocomp, I-40126 Bologna, Italy
[2] BioDec Srl, Casalecchio Di Reno Bolo, Italy
[3] CIPF, Bioinformat Dept, Struct Genom Unit, Valencia, Spain
来源
BMC BIOINFORMATICS | 2008年 / 9卷
关键词
SUPPORT VECTOR MACHINES; SOLVENT ACCESSIBILITY; SECONDARY STRUCTURE; POTENTIALS; SEQUENCE; DISTANCE;
D O I
10.1186/1471-2105-9-S2-S6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A basic question of protein structural studies is to which extent mutations affect the stability. This question may be addressed starting from sequence and/or from structure. In proteomics and genomics studies prediction of protein stability free energy change (Delta Delta G) upon single point mutation may also help the annotation process. The experimental Delta Delta G values are affected by uncertainty as measured by standard deviations. Most of the Delta Delta G values are nearly zero (about 32% of the Delta Delta G data set ranges from -0.5 to 0.5 kcal/mole) and both the value and sign of Delta Delta G may be either positive or negative for the same mutation blurring the relationship among mutations and expected Delta Delta G value. In order to overcome this problem we describe a new predictor that discriminates between 3 mutation classes: destabilizing mutations (Delta Delta G <-1.0 kcal/mol), stabilizing mutations (Delta Delta G > 1.0 kcal/mole) and neutral mutations (-1.0 <=Delta Delta G <= 1.0 kcal/mole). Results: In this paper a support vector machine starting from the protein sequence or structure discriminates between stabilizing, destabilizing and neutral mutations. We rank all the possible substitutions according to a three state classification system and show that the overall accuracy of our predictor is as high as 56% when performed starting from sequence information and 61% when the protein structure is available, with a mean value correlation coefficient of 0.27 and 0.35, respectively. These values are about 20 points per cent higher than those of a random predictor. Conclusions: Our method improves the quality of the prediction of the free energy change due to single point protein mutations by adopting a hypothesis of thermodynamic reversibility of the existing experimental data. By this we both recast the thermodynamic symmetry of the problem and balance the distribution of the available experimental measurements of free energy changes. This eliminates possible overestimations of the previously described methods trained on an unbalanced data set comprising a number of destabilizing mutations higher than stabilizing ones.
引用
收藏
页数:9
相关论文
共 18 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Predicting protein stability changes from sequences using support vector machines [J].
Capriotti, E ;
Fariselli, P ;
Calabrese, R ;
Casadio, R .
BIOINFORMATICS, 2005, 21 :54-58
[3]   I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure [J].
Capriotti, E ;
Fariselli, P ;
Casadio, R .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W306-W310
[4]   A neural-network-based method for predicting protein stability changes upon single point mutations [J].
Capriotti, Emidio ;
Fariselli, Piero ;
Casadio, Rita .
BIOINFORMATICS, 2004, 20 :63-68
[5]  
CASADIO R, 1995, ISMB, V3, P81
[6]   Prediction of protein stability changes for single-site mutations using support vector machines [J].
Cheng, JL ;
Randall, A ;
Baldi, P .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (04) :1125-1132
[7]   Are the parameters of various stabilization factors estimated from mutant human lysozymes compatible with other proteins? [J].
Funahashi, J ;
Takano, K ;
Yutani, K .
PROTEIN ENGINEERING, 2001, 14 (02) :127-134
[8]   Predicting protein stability changes upon mutation using database-derived potentials: Solvent accessibility determines the importance of local versus non-local interactions along the sequence [J].
Gilis, D ;
Rooman, M .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 272 (02) :276-290
[9]   Predicting changes in the stability of proteins and protein complexes: A study of more than 1000 mutations [J].
Guerois, R ;
Nielsen, JE ;
Serrano, L .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 320 (02) :369-387
[10]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637