Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network

被引:155
作者
Chen, Chao [1 ]
Zhou, Xibin [1 ]
Tian, Yuanxin [1 ]
Zou, Xiaoyong [1 ]
Cai, Peixiang [1 ]
机构
[1] Sun Yat Sen Univ, Sch Chem & Chem Engn, Guangzhou 510275, Peoples R China
基金
中国国家自然科学基金;
关键词
support vector machine; fusion; amino acid composition; pair-coupled amino acid composition; pseudo-amino acid composition; protein structural class;
D O I
10.1016/j.ab.2006.07.022
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Because a priori knowledge of a protein structural class can provide useful information about its overall structure, the determination of protein structural class is a quite meaningful topic in protein science. However, with the rapid increase in newly found protein sequences entering into databanks, it is both time-consuming and expensive to do so based solely on experimental techniques. Therefore, it is vitally important to develop a computational method for predicting the protein structural class quickly and accurately. To deal with the challenge, this article presents a dual-layer support vector machine (SVM) fusion network that is featured by using a different pseudo-amino acid composition (PseAA). The PseAA here contains much information that is related to the sequence order of a protein and the distribution of the hydrophobic amino acids along its chain. As a showcase, the rigorous jackknife cross-validation test was performed on the two benchmark data sets constructed by Zhou. A significant enhancement in success rates was observed, indicating that the current approach may serve as a powerful complementary tool to other existing methods in this area. (c) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:116 / 121
页数:6
相关论文
共 51 条
[11]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[12]   Progress in protein structural class prediction and its impact to bioinformatics and proteomics [J].
Chou, KC .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2005, 6 (05) :423-436
[13]   Prediction of membrane protein types by incorporating amphipathic effects [J].
Chou, KC ;
Cai, YD .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (02) :407-413
[14]   A CORRELATION-COEFFICIENT METHOD TO PREDICTING PROTEIN-STRUCTURAL CLASSES FROM AMINO-ACID COMPOSITIONS [J].
CHOU, KC ;
ZHANG, CT .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1992, 207 (02) :429-433
[15]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[16]   Predicting protein structural class by functional domain composition [J].
Chou, KC ;
Cai, YD .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 321 (04) :1007-1009
[17]   Predicting subcellular localization of proteins by hybridizing functional domain composition and pseudo-amino acid composition [J].
Chou, KC ;
Cai, YD .
JOURNAL OF CELLULAR BIOCHEMISTRY, 2004, 91 (06) :1197-1203
[18]   Predicting protein quaternary structure by pseudo amino acid composition [J].
Chou, KC ;
Cai, YD .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 53 (02) :282-289
[19]   Using functional domain composition and support vector machines for prediction of protein subcellular location [J].
Chou, KC ;
Cai, YD .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (48) :45765-45769
[20]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349