Prediction of Protein Secondary Structure Content by Using the Concept of Chou's Pseudo Amino Acid Composition and Support Vector Machine

被引:220
作者
Chen, Chao [1 ]
Chen, Lixuan [2 ]
Zou, Xiaoyong [3 ]
Cai, Peixiang [3 ]
机构
[1] Guangdong Pharmaceut Univ, Sch Tradit Chinese Med, Guangzhou 510006, Guangdong, Peoples R China
[2] Guangzhou Inst Standardizat, Guangzhou 510170, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Chem & Chem Engn, Guangzhou 510275, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Pseudo Amino acid composition; support vector machine; protein secondary structure content; prediction; SUBCELLULAR LOCATION PREDICTION; WEB-SERVER; EVOLUTIONARY INFORMATION; ENSEMBLE CLASSIFIER; CIRCULAR-DICHROISM; FUSION CLASSIFIER; MEMBRANE-PROTEINS; ADABOOST-LEARNER; SIGNAL PEPTIDES; MULTIPLE SITES;
D O I
10.2174/092986609787049420
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein secondary structure carries information about local structural arrangements. Significant majority of successful methods for predicting the secondary structure is based on multiple sequence alignment. However, the multiple alignment fails to achieve accurate results when a protein sequence is characterized by low homology. To this end, we propose a novel method for prediction of secondary structure content through comprehensive sequence representation. The method is featured by employing a support vector machine (SVM) regressing system and adopting a different pseudo amino acid composition (PseAAC), which can partially take into account the sequence-order effects to represent protein samples. It was shown by both the self-consistency test and the independent-dataset test that the trained SVM has remarkable power in grasping the relationship between the PseAAC and the content of protein secondary structural elements, including helix, 310-helix, helix, strand, bridge, turn, bend and the rest random coil. Results prior to or competitive with the popular methods have been obtained, which indicate that the present method may at least serve as an alternative to the existing predictors in this area.
引用
收藏
页码:27 / 31
页数:5
相关论文
共 59 条
[1]   Prediction of protein secondary structure content by artificial neural network [J].
Cai, YD ;
Liu, XJ ;
Chou, KC .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2003, 24 (06) :727-731
[2]   Predicting protein structural class based on multi-features fusion [J].
Chen, Chao ;
Chen, Li-Xuan ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang .
JOURNAL OF THEORETICAL BIOLOGY, 2008, 253 (02) :388-392
[3]   Prediction of protein secondary structure content using support vector machine [J].
Chen, Chao ;
Tian, Yuanxin ;
Zou, Xiaoyong ;
Cai, Peixiang ;
Mo, Jinyuan .
TALANTA, 2007, 71 (05) :2069-2073
[4]   Using pseudo-amino acid composition and support vector machine to predict protein structural class [J].
Chen, Chao ;
Tian, Yuan-Xin ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang ;
Mo, Jin-Yuan .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) :444-448
[5]   Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network [J].
Chen, Chao ;
Zhou, Xibin ;
Tian, Yuanxin ;
Zou, Xiaoyong ;
Cai, Peixiang .
ANALYTICAL BIOCHEMISTRY, 2006, 357 (01) :116-121
[6]   Putative secondary active site of bovine pancreatic deoxyribonuclease I [J].
Chen, Wei-Jung ;
Huang, Po-Tsang ;
Cheng, Yu-Che ;
Liao, Ta-Hsiu .
PROTEIN AND PEPTIDE LETTERS, 2008, 15 (06) :640-646
[7]   Predicting protein structural class by functional domain composition [J].
Chou, KC ;
Cai, YD .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 321 (04) :1007-1009
[8]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[9]   Prediction of protein cellular attributes using pseudo-amino acid composition [J].
Chou, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03) :246-255
[10]   Using pair-coupled amino acid composition to predict protein secondary structure content [J].
Chou, KC .
JOURNAL OF PROTEIN CHEMISTRY, 1999, 18 (04) :473-480