ECS: An automatic enzyme classifier based on functional domain composition

被引:42
作者
Lu, Lingyi
Qian, Ziliang
Cai, Yu-Dong
Li, Yixue
机构
[1] Univ Manchester, Inst Sci & Technol, Dept Math, Manchester M60 1QD, Lancs, England
[2] Chinese Acad Sci, Grad Sch, Beijing 100039, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Biol Sci, CAS MPG Partner Inst Computat Biol, Beijing 100039, Peoples R China
[4] Acad Sinica, Shanghai Inst Biol Sci, Bioinformat Ctr, Key Lab Mol Syst Biol, Shanghai 200031, Peoples R China
[5] Shanghai Ctr Bioinformat Technol, Shanghai 200235, Peoples R China
关键词
enzyme; classification; functional domain composition; support vector machine;
D O I
10.1016/j.compbiolchem.2007.03.008
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classification for enzymes is a prerequisite for understanding their function. Here, an automatic enzyme identifier based on support vector machine (SVM) with feature vectors from protein functional domain composition was built to identify enzymes and further a classifier to classify enzymes into six different classes: oxidoreductase, transferase, hydrolase, lyase, isomerase and ligase. Jackknife cross-validation test was adopted to evaluate the performance of our classifier. The 86.03% success rate achieved for enzyme/non-enzyme identification and 91.32% for enzyme classification, which is much better than that of the BLAST and PSI-BLAST based method, also outperforms several existed works. The results indicate that protein functional domain composition is able to capture the major features which facilitate the identification/classification of proteins, thus demonstrating that our predictor could be a more effective and promising high-throughput method in enzyme research. Moreover, a web-based software Enzyme Classification System (ECS) for identification as well as classification of enzymes can be accessed at: http://pcal.biosino.org/. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:226 / 232
页数:7
相关论文
共 19 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [3] Bateman A, 2002, NUCLEIC ACIDS RES, V30, P276, DOI [10.1093/nar/gkr1065, 10.1093/nar/gkp985, 10.1093/nar/gkh121]
  • [4] Enzyme family classification by support vector machines
    Cai, CZ
    Han, LY
    Ji, ZL
    Chen, YZ
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 55 (01) : 66 - 76
  • [5] Using functional domain composition to predict enzyme family classes
    Cai, YD
    Chou, KC
    [J]. JOURNAL OF PROTEOME RESEARCH, 2005, 4 (01) : 109 - 111
  • [6] Application of SVM to predict membrane protein types
    Cai, YD
    Ricardo, PW
    Jen, CH
    Chou, KC
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2004, 226 (04) : 373 - 376
  • [7] Support vector machines for predicting rRNA-, RNA-, and DNA-binding proteins from amino acid sequence
    Cai, YD
    Lin, SL
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2003, 1648 (1-2): : 127 - 133
  • [8] CAI YD, 2005, J PROTEOME RES, V4
  • [9] Prediction of novel archaeal enzymes from sequence-derived features
    Jensen, LJ
    Skovgaard, M
    Brunak, S
    [J]. PROTEIN SCIENCE, 2002, 11 (12) : 2894 - 2898
  • [10] KC C, 2003, J PROTEOME RES, V2, P183