CATH - a hierarchic classification of protein domain structures

被引:1916
作者
Orengo, CA
Michie, AD
Jones, S
Jones, DT
Swindells, MB
Thornton, JM
机构
[1] UNIV LONDON UNIV COLL,DEPT BIOCHEM & MOL BIOL,BIOMOL STRUCT & MODELLING UNIT,LONDON WC1E 6BT,ENGLAND
[2] UNIV WARWICK,DEPT BIOL SCI,COVENTRY CV4 7AL,W MIDLANDS,ENGLAND
[3] UNIV LONDON BIRKBECK COLL,DEPT CRYSTALLOG,LONDON WC1E 6BT,ENGLAND
关键词
evolution; fold families; protein structure classification;
D O I
10.1016/S0969-2126(97)00260-8
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Protein evolution gives rise to families of structurally related proteins, within which sequence identities can be extremely low. As a result, structure-based classifications can be effective at identifying unanticipated relationships in known structures and in optimal cases function can also be assigned. The ever increasing number of known protein structures is too large to classify all proteins manually, therefore, automatic methods are needed for fast evaluation of protein structures. Results: We present a semi-automatic procedure for deriving a novel hierarchical classification of protein domain structures (CATH). The four main levels of our classification are protein class (C), architecture (A), topology (T) and homologous superfamily (H). Class is the simplest level, and it essentially describes the secondary structure composition of each domain. In contrast, architecture summarises the shape revealed by the orientations of the secondary structure units, such as barrels and sandwiches. At the topology level, sequential connectivity is considered, such that members of the same architecture might have quite different topologies. When structures belonging to the same T-level have suitably high similarities combined with similar functions, the proteins are assumed to be evolutionarily related and put into the same homologous superfamily. Conclusions: Analysis of the structural families generated by CATH reveals the prominent features of protein structure space. We find that nearly a third of the homologous superfamilies (H-levels) belong to ten major T-levels, which we call superfolds, and furthermore that nearly two-thirds of these H-levels cluster into nine simple architectures. A database of well-characterised protein structure families, such as CATH, will facilitate the assignment of structure-function/evolution relationships to both known and newly determined protein structures.
引用
收藏
页码:1093 / 1108
页数:16
相关论文
共 43 条
  • [21] Analysis of domain structural class using an automated class assignment protocol
    Michie, AD
    Orengo, CA
    Thornton, JM
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1996, 262 (02) : 168 - 185
  • [22] STRUCTURAL PRINCIPLES FOR THE PROPELLER ASSEMBLY OF BETA-SHEETS - THE PREFERENCE FOR 7-FOLD SYMMETRY
    MURZIN, AG
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1992, 14 (02): : 191 - 201
  • [23] OB(OLIGONUCLEOTIDE OLIGOSACCHARIDE BINDING)-FOLD - COMMON STRUCTURAL AND FUNCTIONAL SOLUTION FOR NONHOMOLOGOUS SEQUENCES
    MURZIN, AG
    [J]. EMBO JOURNAL, 1993, 12 (03) : 861 - 867
  • [24] MURZIN AG, 1995, J MOL BIOL, V247, P536, DOI 10.1016/S0022-2836(05)80134-2
  • [25] A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS
    NEEDLEMAN, SB
    WUNSCH, CD
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) : 443 - +
  • [26] CLASSIFICATION OF PROTEIN FOLDS
    ORENGO, C
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1994, 4 (03) : 429 - 440
  • [27] FAST STRUCTURE ALIGNMENT FOR PROTEIN DATA-BANK SEARCHING
    ORENGO, CA
    BROWN, NP
    TAYLOR, WR
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1992, 14 (02) : 139 - 167
  • [28] IDENTIFICATION AND CLASSIFICATION OF PROTEIN FOLD FAMILIES
    ORENGO, CA
    FLORES, TP
    TAYLOR, WR
    THORNTON, JM
    [J]. PROTEIN ENGINEERING, 1993, 6 (05): : 485 - 500
  • [29] PROTEIN SUPERFAMILIES AND DOMAIN SUPERFOLDS
    ORENGO, CA
    JONES, DT
    THORNTON, JM
    [J]. NATURE, 1994, 372 (6507) : 631 - 634
  • [30] ALPHA-FOLD PLUS BETA-FOLD REVISITED - SOME FAVORED MOTIFS
    ORENGO, CA
    THORNTON, JM
    [J]. STRUCTURE, 1993, 1 (02) : 105 - 120