Using pseudo-amino acid composition and support vector machine to predict protein structural class

被引:173
作者
Chen, Chao [1 ]
Tian, Yuan-Xin [1 ]
Zou, Xiao-Yong [1 ]
Cai, Pei-Xiang [1 ]
Mo, Jin-Yuan [1 ]
机构
[1] Sun Yat Sen Univ, Sch Chem & Chem Engn, Guangzhou 510275, Peoples R China
关键词
support vector machine; pseudo-amino acid composition; protein structural class; prediction;
D O I
10.1016/j.jtbi.2006.06.025
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
As a result of genome and other sequencing projects, the gap between the number of known protein sequences and the number of known protein structural classes is widening rapidly. In order to narrow this gap, it is vitally important to develop a computational prediction method for fast and accurately determining the protein structural class. In this paper, a novel predictor is developed for predicting protein structural class. It is featured by employing a support vector machine learning system and using a different pseudoamino acid composition (PseAA), which was introduced to, to some extent, take into account the sequence-order effects to represent protein samples. As a demonstration, the jackknife cross-validation test was performed on a working dataset that contains 204 nonhomologous proteins. The predicted results are very encouraging, indicating that the current predictor featured with the PseAA may play an important complementary role to the elegant covariant discriminant predictor and other existing algorithms. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:444 / 448
页数:5
相关论文
共 47 条
  • [1] Bahar I, 1997, PROTEINS, V29, P172, DOI 10.1002/(SICI)1097-0134(199710)29:2<172::AID-PROT5>3.3.CO
  • [2] 2-D
  • [3] Assessing the accuracy of prediction algorithms for classification: an overview
    Baldi, P
    Brunak, S
    Chauvin, Y
    Andersen, CAF
    Nielsen, H
    [J]. BIOINFORMATICS, 2000, 16 (05) : 412 - 424
  • [4] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [5] Prediction of protein structural classes by neural network
    Cai, YD
    Zhou, GP
    [J]. BIOCHIMIE, 2000, 82 (08) : 783 - 785
  • [6] Using LogitBoost classifier to predict protein structural classes
    Cai, YD
    Feng, KY
    Lu, WC
    Chou, KC
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2006, 238 (01) : 172 - 176
  • [7] Support vector machines for predicting membrane protein types by using functional domain composition
    Cai, YD
    Zhou, GP
    Chou, KC
    [J]. BIOPHYSICAL JOURNAL, 2003, 84 (05) : 3257 - 3263
  • [8] Prediction of protein structural classes by support vector machines
    Cai, YD
    Liu, XJ
    Xu, XB
    Chou, KC
    [J]. COMPUTERS & CHEMISTRY, 2002, 26 (03): : 293 - 296
  • [9] Support Vector Machines for predicting protein structural class
    Cai, Yu-Dong
    Liu, Xiao-Jun
    Xu, Xue-biao
    Zhou, Guo-Ping
    [J]. BMC BIOINFORMATICS, 2001, 2 (1)
  • [10] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)