SUMOhydro: A Novel Method for the Prediction of Sumoylation Sites Based on Hydrophobic Properties

被引:58
作者
Chen, Yong-Zi [1 ,2 ]
Chen, Zhen [3 ]
Gong, Yu-Ai [1 ,2 ]
Ying, Guoguang [1 ,2 ]
机构
[1] Tianjin Med Univ, Canc Inst & Hosp Tianjin, Canc Cell Biol Lab, Tianjin, Peoples R China
[2] Tianjin Municipal Sci & Technol Commiss, Tianjin Key Lab Canc Prevent & Therapy, Tianjin, Peoples R China
[3] China Agr Univ, Coll Biol Sci, Bioinformat Ctr, Beijing 100094, Peoples R China
关键词
AMINO-ACID PAIRS; RNA-BINDING SITES; PROTEIN SEQUENCES; WEB SERVER; PSI-BLAST; SUMO; INFORMATION; COLLOCATION; PLANTS; MOTIF;
D O I
10.1371/journal.pone.0039195
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Sumoylation is one of the most essential mechanisms of reversible protein post-translational modifications and is a crucial biochemical process in the regulation of a variety of important biological functions. Sumoylation is also closely involved in various human diseases. The accurate computational identification of sumoylation sites in protein sequences aids in experimental design and mechanistic research in cellular biology. In this study, we introduced amino acid hydrophobicity as a parameter into a traditional binary encoding scheme and developed a novel sumoylation site prediction tool termed SUMOhydro. With the assistance of a support vector machine, the proposed method was trained and tested using a stringent non-redundant sumoylation dataset. In a leave-one-out cross-validation, the proposed method yielded an excellent performance with a correlation coefficient, specificity, sensitivity and accuracy equal to 0.690, 98.6%, 71.1% and 97.5%, respectively. In addition, SUMOhydro has been benchmarked against previously described predictors based on an independent dataset, thereby suggesting that the introduction of hydrophobicity as an additional parameter could assist in the prediction of sumoylation sites. Currently, SUMOhydro is freely accessible at http://protein.cau.edu.cn/others/SUMOhydro/.
引用
收藏
页数:8
相关论文
共 35 条
[1]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]
Viruses and sumoylation: recent highlights [J].
Boggio, Roberto ;
Chiocca, Susanna .
CURRENT OPINION IN MICROBIOLOGY, 2006, 9 (04) :430-436
[3]
SIGNAL DETECTABILITY - THE USE OF ROC CURVES AND THEIR ANALYSES [J].
CENTOR, RM .
MEDICAL DECISION MAKING, 1991, 11 (02) :102-106
[4]
Prediction of protein structural class using novel evolutionary collocation-based sequence representation [J].
Chen, Ke ;
Kurgan, Lukasz A. ;
Ruan, Jishou .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (10) :1596-1604
[5]
Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs [J].
Chen, Ke ;
Kurgan, Lukasz A. ;
Ruan, Jishou .
BMC STRUCTURAL BIOLOGY, 2007, 7
[6]
Prediction of protein crystallization using collocation of amino acid pairs [J].
Chen, Ke ;
Kurgan, Lukasz ;
Rahbari, Mandana .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2007, 355 (03) :764-769
[7]
Prediction of mucin-type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs [J].
Chen, Yong-Zi ;
Tang, Yu-Rong ;
Sheng, Zhi-Ya ;
Zhang, Ziding .
BMC BIOINFORMATICS, 2008, 9 (1) :101
[8]
Predicting RNA-binding sites of proteins using support vector machines and evolutionary information [J].
Cheng, Cheng-Wei ;
Su, Emily Chia-Yu ;
Hwang, Jenn-Kang ;
Sung, Ting-Yi ;
Hsu, Wen-Lian .
BMC BIOINFORMATICS, 2008, 9
[9]
SUMO on the road to neurodegeneration [J].
Dorval, Veronique ;
Fraser, Paul E. .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH, 2007, 1773 (06) :694-706
[10]
Gao JJ, 2009, LECT N BIOINFORMAT, V5462, P18