Using a neural network and spatial clustering to predict the location of active sites in enzymes

被引:153
作者
Gutteridge, A [1 ]
Bartlett, GJ [1 ]
Thornton, JM [1 ]
机构
[1] EBI, EMBL Outstn, Hinxton CB10 1SD, Cambs, England
基金
英国医学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
bioinformatics; structural genomics; functional prediction; neural network; active sites;
D O I
10.1016/S0022-2836(03)00515-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural genomics projects aim to provide a sharp increase in the number of structures of functionally unannotated, and largely unstudied, proteins. Algorithms and tools capable of deriving information about the nature, and location, of functional sites within a structure are increasingly useful therefore. Here, a neural network is trained to identify the catalytic residues found in enzymes, based on an analysis of the structure and sequence. The neural network output, and spatial clustering of the highly scoring residues are then used to predict the location of the active site. A comparison of the performance of differently trained neural networks is presented that shows how information from sequence and structure come together to improve the prediction accuracy of the network. Spatial clustering of the network results provides a reliable way of finding likely active sites. In over 69% of the test cases the active site is correctly predicted, and a further 25% are partially correctly predicted. The failures are generally due to the poor quality of the automatically generated sequence alignments. We also present predictions identifying the active site, and potential functional residues in five recently solved enzyme structures, not used in developing the method. The method correctly identifies the putative active site in each case. In most cases the likely functional residues are identified correctly, as well as some potentially novel functional groups. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:719 / 734
页数:16
相关论文
共 59 条
  • [1] Aktories K, 1997, ADV EXP MED BIOL, V419, P53
  • [2] Automated structure-based prediction of functional sites in proteins: Applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking
    Aloy, P
    Querol, E
    Aviles, FX
    Sternberg, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 311 (02) : 395 - 408
  • [3] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [4] ConSurf: An algorithmic tool for the identification of functional regions in proteins by surface mapping of phylogenetic information
    Armon, A
    Graur, D
    Ben-Tal, N
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (01) : 447 - 463
  • [5] THE ENZYME DATA-BANK
    BAIROCH, A
    [J]. NUCLEIC ACIDS RESEARCH, 1993, 21 (13) : 3155 - 3156
  • [6] Analysis of catalytic residues in enzyme active sites
    Bartlett, GJ
    Porter, CT
    Borkakoti, N
    Thornton, JM
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2002, 324 (01) : 105 - 121
  • [7] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [8] BERGERBACHI B, 1989, MOL GEN GENET, V219, P263
  • [9] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [10] Using GeneWise in the Drosophila annotation experiment
    Birney, E
    Durbin, R
    [J]. GENOME RESEARCH, 2000, 10 (04) : 547 - 548