Prediction of Thermostability from Amino Acid Attributes by Combination of Clustering with Attribute Weighting: A New Vista in Engineering Enzymes

被引:49
作者
Ebrahimi, Mansour [1 ,2 ]
Lakizadeh, Amir [2 ,3 ]
Agha-Golzadeh, Parisa [4 ]
Ebrahimie, Esmaeil [4 ]
Ebrahimi, Mahdi [5 ]
机构
[1] Univ Qom, Dept Biol, Qom, Iran
[2] Univ Qom, Bioinformat Res Grp, Qom, Iran
[3] Univ Qom, Dept Comp Sci, Qom, Iran
[4] Shiraz Univ, Coll Agr, Dept Crop Prod & Plant Breeding, Shiraz, Iran
[5] Max Planck Inst Informat, Saarbrucken, Germany
来源
PLOS ONE | 2011年 / 6卷 / 08期
关键词
THERMAL-STABILITY; 3-ISOPROPYLMALATE DEHYDROGENASE; PROTEIN STABILITY; STRUCTURAL BASIS; EXPRESSION; XYLANASE; FEATURES; PATTERNS; DISCRIMINATION; STABILIZATION;
D O I
10.1371/journal.pone.0023146
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The engineering of thermostable enzymes is receiving increased attention. The paper, detergent, and biofuel industries, in particular, seek to use environmentally friendly enzymes instead of toxic chlorine chemicals. Enzymes typically function at temperatures below 60 degrees C and denature if exposed to higher temperatures. In contrast, a small portion of enzymes can withstand higher temperatures as a result of various structural adaptations. Understanding the protein attributes that are involved in this adaptation is the first step toward engineering thermostable enzymes. We employed various supervised and unsupervised machine learning algorithms as well as attribute weighting approaches to find amino acid composition attributes that contribute to enzyme thermostability. Specifically, we compared two groups of enzymes: mesostable and thermostable enzymes. Furthermore, a combination of attribute weighting with supervised and unsupervised clustering algorithms was used for prediction and modelling of protein thermostability from amino acid composition properties. Mining a large number of protein sequences (2090) through a variety of machine learning algorithms, which were based on the analysis of more than 800 amino acid attributes, increased the accuracy of this study. Moreover, these models were successful in predicting thermostability from the primary structure of proteins. The results showed that expectation maximization clustering in combination with uncertainly and correlation attribute weighting algorithms can effectively (100%) classify thermostable and mesostable proteins. Seventy per cent of the weighting methods selected Gln content and frequency of hydrophilic residues as the most important protein attributes. On the dipeptide level, the frequency of Asn-Glu was the key factor in distinguishing mesostable from thermostable enzymes. This study demonstrates the feasibility of predicting thermostability irrespective of sequence similarity and will serve as a basis for engineering thermostable enzymes in the laboratory.
引用
收藏
页数:11
相关论文
共 54 条
[1]   ProSOM:: core promoter prediction based on unsupervised clustering of DNA physical profiles [J].
Abeel, Thomas ;
Saeys, Yvan ;
Rouze, Pierre ;
Van de Peer, Yves .
BIOINFORMATICS, 2008, 24 (13) :I24-I31
[2]   Finding and using hyperthermophilic enzymes [J].
Adams, MWW ;
Kelly, RM .
TRENDS IN BIOTECHNOLOGY, 1998, 16 (08) :329-332
[3]   Crystal structure of the beta-glycosidase from the hyperthermophilic archeon Sulfolobus solfataricus: Resilience as a key factor in thermostability [J].
Aguilar, CF ;
Sanderson, I ;
Moracci, M ;
Ciaramella, M ;
Nucci, R ;
Rossi, M ;
Pearl, LH .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 271 (05) :789-802
[4]   Effect of polar side chains at position 172 on thermal stability of 3-isopropylmalate dehydrogenase from Thermus thermophilus [J].
Akanuma, S ;
Qu, CX ;
Yamagishi, A ;
Tanaka, N ;
Oshima, T .
FEBS LETTERS, 1997, 410 (2-3) :141-144
[5]   Biochemical and structural characterization of a short-chain dehydrogenase/reductase of Thermus thermophilus HB8 A hyperthermostable aldose-1-dehydrogenase with broad substrate specificity [J].
Asada, Yukuhiko ;
Endo, Satoshi ;
Inoue, Yukari ;
Mamiya, Hiroaki ;
Hara, Akira ;
Kunishima, Naoki ;
Matsunaga, Toshiyuki .
CHEMICO-BIOLOGICAL INTERACTIONS, 2009, 178 (1-3) :117-126
[6]   Amino Acid Features of P1B-ATPase Heavy Metal Transporters Enabling Small Numbers of Organisms to Cope with Heavy Metal Pollution [J].
Ashrafi, E. ;
Alemzadeh, A. ;
Ebrahimi, M. ;
Ebrahimie, E. ;
Dadkhodaei, N. .
BIOINFORMATICS AND BIOLOGY INSIGHTS, 2011, 5 :59-82
[7]  
ASHRAFI E, 2011, BIOINFORMAT IN PRESS
[8]  
BALASUBRAMANIAN D, 2007, C P IEEE ENG MED BIO, P2134
[9]   Comprehensive analysis of the factors contributing to the stability and solubility of autonomous human VH domains [J].
Barthelemy, Pierre A. ;
Raab, Helga ;
Appleton, Brent A. ;
Bond, Christopher J. ;
Wu, Ping ;
Wiesmann, Christian ;
Sidhu, Sachdev S. .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2008, 283 (06) :3639-3654
[10]   A new data mining approach for profiling and categorizing kinetic patterns of metabolic biomarkers after myocardial injury [J].
Baumgartner, Christian ;
Lewis, Gregory D. ;
Netzer, Michael ;
Pfeifer, Bernhard ;
Gerszten, Robert E. .
BIOINFORMATICS, 2010, 26 (14) :1745-1751