Touring protein fold space with Dali/FSSP

被引:571
作者
Holm, L [1 ]
Sander, C [1 ]
机构
[1] European Mol Biol Lab, European Bioinformat Inst, Cambridge CB10 1SD, England
关键词
D O I
10.1093/nar/26.1.316
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The FSSP database and its new supplement, the Dali Domain Dictionary, present a continuously updated classification of all known 3D protein structures, The classification is derived using an automatic structure alignment program (Dali) for the all-against-all comparison of structures in the Protein Data Bank. From the resulting enumeration of structural neighbours (which form a surprisingly continuous distribution in fold space) we derive a discrete fold classification in three steps: (i) sequence-related families are covered by a representative set of protein chains; (ii) protein chains are decomposed into structural domains based on the recurrence of structural motifs; (iii) folds are defined as tight clusters of domains in fold space, The fold classification, domain definitions and test sets for sequence-structure alignment (threading) are accessible on the web at www.embl-ebi.ac.uk/dali. The web interface provides a rich network of links between neighbours in fold space, between domains and proteins, and between structures and sequences leading, for example, to a database of explicit multiple alignments of protein families in the twilight zone of sequence similarity, The Dali/FSSP organization of protein structures provides a map of the currently known regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination.
引用
收藏
页码:316 / 319
页数:4
相关论文
共 18 条
  • [1] PROSITE - A DICTIONARY OF SITES AND PATTERNS IN PROTEINS
    BAIROCH, A
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 : 2013 - 2018
  • [2] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [3] Etzold T, 1996, METHOD ENZYMOL, V266, P114
  • [4] Surprising similarities in structure comparison
    Gibrat, JF
    Madej, T
    Bryant, SH
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) : 377 - 385
  • [5] Mapping the protein universe
    Holm, L
    Sander, C
    [J]. SCIENCE, 1996, 273 (5275) : 595 - 602
  • [6] Enzyme HIT
    Holm, L
    Sander, C
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1997, 22 (04) : 116 - 117
  • [7] A DATABASE OF PROTEIN-STRUCTURE FAMILIES WITH COMMON FOLDING MOTIFS
    HOLM, L
    OUZOUNIS, C
    SANDER, C
    TUPAREV, G
    VRIEND, G
    [J]. PROTEIN SCIENCE, 1992, 1 (12) : 1691 - 1698
  • [8] Holm L, 1997, PROTEINS, V28, P72, DOI 10.1002/(SICI)1097-0134(199705)28:1<72::AID-PROT7>3.0.CO
  • [9] 2-L
  • [10] HOLM L, 1997, INTEL SYS MOL BIOL, V5, P140