Prediction of Protein Secondary Structure Content by Using the Concept of Chou's Pseudo Amino Acid Composition and Support Vector Machine

被引:220
作者
Chen, Chao [1 ]
Chen, Lixuan [2 ]
Zou, Xiaoyong [3 ]
Cai, Peixiang [3 ]
机构
[1] Guangdong Pharmaceut Univ, Sch Tradit Chinese Med, Guangzhou 510006, Guangdong, Peoples R China
[2] Guangzhou Inst Standardizat, Guangzhou 510170, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Chem & Chem Engn, Guangzhou 510275, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Pseudo Amino acid composition; support vector machine; protein secondary structure content; prediction; SUBCELLULAR LOCATION PREDICTION; WEB-SERVER; EVOLUTIONARY INFORMATION; ENSEMBLE CLASSIFIER; CIRCULAR-DICHROISM; FUSION CLASSIFIER; MEMBRANE-PROTEINS; ADABOOST-LEARNER; SIGNAL PEPTIDES; MULTIPLE SITES;
D O I
10.2174/092986609787049420
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein secondary structure carries information about local structural arrangements. Significant majority of successful methods for predicting the secondary structure is based on multiple sequence alignment. However, the multiple alignment fails to achieve accurate results when a protein sequence is characterized by low homology. To this end, we propose a novel method for prediction of secondary structure content through comprehensive sequence representation. The method is featured by employing a support vector machine (SVM) regressing system and adopting a different pseudo amino acid composition (PseAAC), which can partially take into account the sequence-order effects to represent protein samples. It was shown by both the self-consistency test and the independent-dataset test that the trained SVM has remarkable power in grasping the relationship between the PseAAC and the content of protein secondary structural elements, including helix, 310-helix, helix, strand, bridge, turn, bend and the rest random coil. Results prior to or competitive with the popular methods have been obtained, which indicate that the present method may at least serve as an alternative to the existing predictors in this area.
引用
收藏
页码:27 / 31
页数:5
相关论文
共 59 条
[21]   Amino Acid Principal Component Analysis (AAPCA) and its applications in protein structural class prediction [J].
Du, Qi-Shi ;
Jiang, Zhi-Qin ;
He, Wen-Zhang ;
Li, Da-Peng ;
Chou, Kou-Chen .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2006, 23 (06) :635-640
[22]   Correlations of amino acids in proteins [J].
Du, QS ;
Wei, DQ ;
Chou, KC .
PEPTIDES, 2003, 24 (12) :1863-1869
[23]   Predicting DNA-binding proteins: approached from Chou's pseudo amino acid composition and other specific sequence features [J].
Fang, Y. ;
Guo, Y. ;
Feng, Y. ;
Li, M. .
AMINO ACIDS, 2008, 34 (01) :103-109
[24]   Proteomics, networks and connectivity indices [J].
Gonzalez-Diaz, Humberto ;
Gonzalez-Diaz, Yenny ;
Santana, Lourdes ;
Ubeira, Florencio M. ;
Uriarte, Eugenio .
PROTEOMICS, 2008, 8 (04) :750-778
[25]   Medicinal chemistry and bioinformatics -: Current trends in drugs discovery with networks topological indices [J].
Gonzalez-Diaz, Humberto ;
Vilar, Santiago ;
Santana, Lourdes ;
Uriarte, Eugenio .
CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2007, 7 (10) :1015-1029
[26]   Prediction of protein secondary structure content for the twilight zone sequences [J].
Homaeian, Leila ;
Kurgan, Lukasz A. ;
Ruan, Jishou ;
Cios, Krzysztof J. ;
Chen, Ke .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 (03) :486-498
[27]   Using the concept of Chou's Pseudo Amino Acid composition to predict apoptosis proteins subcellular location: An approach by approximate entropy [J].
Jiang, Xiaoying ;
Wei, Rong ;
Zhang, Tongliang ;
Gu, Quan .
PROTEIN AND PEPTIDE LETTERS, 2008, 15 (04) :392-396
[28]  
Jin YH, 2008, PROTEIN PEPTIDE LETT, V15, P286
[29]   Protein secondary structure prediction based on position-specific scoring matrices [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 292 (02) :195-202
[30]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637