Analysis of domain structural class using an automated class assignment protocol

被引:51
作者
Michie, AD [1 ]
Orengo, CA [1 ]
Thornton, JM [1 ]
机构
[1] UNIV LONDON UNIV COLL,DEPT BIOCHEM & MOL BIOL,BIOMOL STRUCT & MODELLING UNIT,LONDON WC1E 6BT,ENGLAND
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
protein domain; structural class; algorithm definition; automatic assignment;
D O I
10.1006/jmbi.1996.0506
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The extent to which the contemporary dataset of protein structures can be segregated into four structural ''classes'' as originally defined by Levitt & Chothia in 1976 is examined and a simple method presented for the assignment of protein domains into these classes. Assignments are based on known three-dimensional structures, and for successful assignment it was found that helix/sheet content, contacts between secondary structures and their sequential order had to be used. The procedure attempts to maximise the automatic separation into classes for a dataset of 197 manually classified, non-homologous domains. It was found that approximately 90% of the structures were classified automatically; the remainder were borderline and were left for manual inspection. The method was then applied to a test set of 43 protein domains with similar results. The data support the concept of distinct classes of protein structure, although a few intermediate structures are found, demonstrating that it is possible to define relatively simple parameters complying with commonly accepted nomenclature that automatically define 90% of protein domains with essentially 100% accuracy. However, re-examination of the data also suggested that the previously separate alpha/beta and alpha + beta classes show considerable overlap and are more naturally represented as a single alpha beta class. This large alpha beta class can then be most easily subdivided by consideration of whether the sheets are mainly parallel, antiparallel or mixed. The correlation between structural class and function is discussed, together with the conservation of class within a sequence superfamily. This represents the first step in an automated phenetic description of protein structure complementing the usual phylogenetic approach to protein structure classification. (C) 1996 Academic Press Limited
引用
收藏
页码:168 / 185
页数:18
相关论文
共 37 条
  • [1] THE ENZYME DATA-BANK
    BAIROCH, A
    [J]. NUCLEIC ACIDS RESEARCH, 1993, 21 (13) : 3155 - 3156
  • [2] BAIROCH A, 1994, NUCLEIC ACIDS RES, V22, P3583
  • [3] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [4] BIELKA H, 1984, ENZYME NOMENCLATURE
  • [5] ACCURATE PREDICTION OF PROTEIN SECONDARY STRUCTURAL CLASS WITH FUZZY STRUCTURAL VECTORS
    BOBERG, J
    SALAKOSKI, T
    VIHINEN, M
    [J]. PROTEIN ENGINEERING, 1995, 8 (06): : 505 - 512
  • [6] CHANDONIA JM, 1995, PROTEIN SCI, V4, P275
  • [7] CRYSTAL-STRUCTURE OF THE RIBONUCLEASE-H DOMAIN OF HIV-1 REVERSE-TRANSCRIPTASE
    DAVIES, JF
    HOSTOMSKA, Z
    HOSTOMSKY, Z
    JORDAN, SR
    MATTHEWS, DA
    [J]. SCIENCE, 1991, 252 (5002) : 88 - 95
  • [8] DELEAGE G, 1989, PREDICTION PROTEIN S, P587
  • [9] PREDICTION OF PROTEIN FOLDING CLASS FROM AMINO-ACID-COMPOSITION
    DUBCHAK, I
    HOLBROOK, SR
    KIM, SH
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1993, 16 (01): : 79 - 91
  • [10] ELEENBERGER T, 1992, CELL, V71, P1223