Identification of homology in protein structure classification

被引:100
作者
Dietmann, S [1 ]
Holm, L [1 ]
机构
[1] EBI, EMBL, Struct Genom Grp, Cambridge CB10 1SD, England
关键词
D O I
10.1038/nsb1101-953
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural biology and structural genomics are expected to produce many three-dimensional protein structures in the near future. Each new structure raises questions about its function and evolution. Correct functional and evolutionary classification of a new structure is difficult for distantly related proteins and error-prone using simple statistical scores based on sequence or structure similarity. Here we present an accurate numerical method for the identification of evolutionary relationships (homology). The method is based on the principle that natural selection maintains structural and functional continuity within a diverging protein family. The problem of different rates of structural divergence between different families is solved by first using structural similarities to produce a global map of folds in protein space and then further subdividing fold neighborhoods into superfamilies based on functional similarities. In a validation test against a classification by human experts (SCOP), 77% of homologous pairs were identified with 92% reliability. The method is fully automated, allowing fast, self-consistent and complete classification of large numbers of protein structures. In particular, the discrimination between analogy and homology of close structural neighbors will lead to functional predictions while avoiding overprediction.
引用
收藏
页码:953 / 957
页数:5
相关论文
共 23 条
  • [1] Baldi P., 1998, Bioinformatics: The machine learning approach
  • [2] Bishop C. M., 1995, NEURAL NETWORKS PATT
  • [3] Structure of the globular region of the prion protein Ure2 from the yeast Saccharomyces cerevisiae
    Bousset, L
    Belrhali, H
    Janin, J
    Melki, R
    Morera, S
    [J]. STRUCTURE, 2001, 9 (01) : 39 - 46
  • [4] Crystal structures of a novel ferric reductase from the hyperthermophilic archaeon Archaeoglobus fulgidus and its complex with NADP+
    Chiu, HJ
    Johnson, E
    Schröder, I
    Rees, DC
    [J]. STRUCTURE, 2001, 9 (04) : 311 - 319
  • [5] Christendat D, 2000, NAT STRUCT BIOL, V7, P903
  • [6] A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3
    Dietmann, S
    Park, J
    Notredame, C
    Heger, A
    Lappe, M
    Holm, L
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 55 - 57
  • [7] Fahlman S. E., 1990, ADV NEURAL INFORMATI, P524, DOI DOI 10.1190/1.1821929
  • [8] Crystal structure of the hydrogenase maturating endopeptidase HYBD from Escherichia coli
    Fritsche, E
    Paschos, A
    Beisel, HG
    Böck, A
    Huber, R
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1999, 288 (05) : 989 - 998
  • [9] Mapping the protein universe
    Holm, L
    Sander, C
    [J]. SCIENCE, 1996, 273 (5275) : 595 - 602
  • [10] Holm L, 1998, PROTEINS, V33, P88, DOI 10.1002/(SICI)1097-0134(19981001)33:1<88::AID-PROT8>3.0.CO