Prediction of enzyme family classes

被引:106
作者
Chou, KC [1 ]
Elrod, DW [1 ]
机构
[1] Pharmacia, Comp Aided Drug Discovery Bioinformat, Kalamazoo, MI 49007 USA
关键词
bioinformatics; classification of enzyme commission; oxidoreductases; subfamilies; amino acid-composition; covariant-discriminant algorithm;
D O I
10.1021/pr0255710
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Classes of newly found enzyme sequences are usually determined either by biochemical analysis of eukaryotic and prokaryotic genomes or by microarray chips. These experimental methods are both time-consuming and costly. With the explosion of protein sequences entering into databanks, it is highly desirable to explore the feasibility of selectively classifying newly found enzyme sequences into their respective enzyme classes by means of an automated method. This is indeed important because knowing which family or subfamily an enzyme belongs to may help deduce its catalytic mechanism and specificity, giving clues to the relevant biological function. In this study, a bioinformatical analysis was conducted for 2640 oxidoreductases classified into 16 subclasses according to the different types of substrates they act on during the catalytic process. Although it is an extremely complicated problem and might involve the knowledge of 3-dimensional structure as well as many other physical chemistry factors, some quite promising results have been obtained indicating that the family or subfamily of an enzyme is predictable to a considerable degree by means of sequence-based approach alone if a good training dataset can be established.
引用
收藏
页码:183 / 190
页数:8
相关论文
共 18 条
  • [1] [Anonymous], 1992, ENZYME NOMENCLATURE
  • [2] The SWISS-PROT protein sequence data bank and its supplement TrEMBL
    Bairoch, A
    Apweller, R
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (01) : 31 - 36
  • [3] The ENZYME database in 2000
    Bairoch, A
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 304 - 305
  • [4] Is it a paradox or misinterpretation?
    Cai, YD
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03): : 336 - 338
  • [5] Protein subcellular location prediction
    Chou, KC
    Elrod, DW
    [J]. PROTEIN ENGINEERING, 1999, 12 (02): : 107 - 118
  • [6] PREDICTION OF PROTEIN STRUCTURAL CLASSES
    CHOU, KC
    ZHANG, CT
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) : 275 - 349
  • [7] A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE
    CHOU, KC
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 21 (04): : 319 - 344
  • [8] Chou KC, 1998, PROTEINS, V31, P97, DOI 10.1002/(SICI)1097-0134(19980401)31:1<97::AID-PROT8>3.3.CO
  • [9] 2-Y
  • [10] CHOU KC, 1994, J BIOL CHEM, V269, P22014