A novel approach to predict active sites of enzyme molecules

被引:38
作者
Chou, KC
Cai, YD
机构
[1] Gordon Life Sci Inst, San Diego, CA 92130 USA
[2] Tianjin Inst Bioinformat & Drug Discovery, Tianjin, Peoples R China
[3] UMIST, Biomol Sci Dept, Manchester M60 1QD, Lancs, England
关键词
catalytic triad; serine hydrolase; topological entity; covariance matrix; Mahalanobis distance; discriminant function; structural bioinformatics;
D O I
10.1002/prot.10622
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Enzymes are critical in many cellular signaling cascades. With many enzyme structures being solved, there is an increasing need to develop an automated method for identifying their active sites. However, given the atomic coordinates of an enzyme molecule, how can we predict its active site? This is a vitally important problem because the core of an enzyme molecule is its active site from the viewpoints of both pure scientific research and industrial application. In this article, a topological entity was introduced to characterize the enzymatic active site. Based on such a concept, the covariant discriminant algorithm was formulated for identifying the active site. As a paradigm, the serine hydrolase family was demonstrated. The overall success rate by jackknife test for a data set of 88 enzyme molecules was 99.92%, and that for a data set of 50 independent enzyme molecules was 99.91%. Meanwhile, it was shown through an example that the prediction algorithm can also be used to find any typographic error of a PDB file in annotating the constituent amino acids of catalytic triad and to suggest a possible correction. The very high success rates are due to the introduction of a covariance matrix in the prediction algorithm that makes allowance for taking into account the coupling effects among the key constituent atoms of active site. It is anticipated that the novel approach is quite promising and may become a useful high throughput tool in enzymology, proteomics, and structural bioinformatics. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:77 / 82
页数:6
相关论文
共 26 条
[1]   A GRAPH-THEORETIC APPROACH TO THE IDENTIFICATION OF 3-DIMENSIONAL PATTERNS OF AMINO-ACID SIDE-CHAINS IN PROTEIN STRUCTURES [J].
ARTYMIUK, PJ ;
POIRRETTE, AR ;
GRINDLEY, HM ;
RICE, DW ;
WILLETT, P .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (02) :327-344
[2]   The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[3]   Support vector machines for prediction of protein subcellular location by incorporating quasi-sequence-order effect [J].
Cai, YD ;
Liu, XJ ;
Xu, XB ;
Chou, KC .
JOURNAL OF CELLULAR BIOCHEMISTRY, 2002, 84 (02) :343-348
[4]  
Cai Yu-Dong, 2000, Molecular Cell Biology Research Communications, V4, P172, DOI 10.1006/mcbr.2001.0269
[5]   Solution structure of BID, an intracellular amplifier of apoptotic signaling [J].
Chou, JJ ;
Li, HL ;
Salvesen, GS ;
Yuan, JY ;
Wagner, G .
CELL, 1999, 96 (05) :615-624
[6]   Solution structure of Ca2+-calmodulin reveals flexible hand-like properties of its domains [J].
Chou, JJ ;
Li, SP ;
Klee, CB ;
Bax, A .
NATURE STRUCTURAL BIOLOGY, 2001, 8 (11) :990-997
[7]   Solution structure of the RAIDD CARD and model for CARD/CARD interaction in caspase-2 and caspase-9 recruitment [J].
Chou, JJ ;
Matsuo, H ;
Duan, H ;
Wagner, G .
CELL, 1998, 94 (02) :171-180
[9]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[10]   A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE [J].
CHOU, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 21 (04) :319-344