A structural census of genomes: Comparing bacterial, eukaryotic, and archaeal genomes in terms of protein structure

被引:118
作者
Gerstein, M [1 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
关键词
D O I
10.1006/jmbi.1997.1412
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Representative genomes from each of the three kingdoms of life are compared in terms of protein structure, in particular, those of Haemophilus influenzae (a bacteria), Methanococcus jannaschii (an archaeon), and yeast (a eukaryote). The comparison is in the form of a census (or comprehensive accounting) of the relative occurrence of secondary and tertiary structures in the genomes, which particular emphasis on patterns of supersecondary structure. Comparison of secondary structure shows that the three genomes have nearly the same overall secondary-structure content, although they differ markedly in amino acid composition. Comparison of super-secondary structure, using a novel "frequent-words" approach, shows that yeast has a preponderance of consecutive strands (e.g. beta-beta-beta patterns), Haemophilus, consecutive helices (alpha-alpha-alpha), and Methanococcus, alternating helix-strand structures (beta-alpha-beta). Yeast also has significantly more helical membrane proteins than the other two genomes, with most of the differences concentrated in proteins containing two transmembrane segments. Comparison of tertiary structure (by sequence matching and domain-level clustering highlights the substantial duplication in each genome (approximate to 30% to 50%), with the degree of duplication following similar patterns in. all three. Many sequence families are shared among the genomes, with the degree of overlap between any two genomes being roughly similar. Ln total, the three genomes contain 148 of the approximate to 300 known protein folds. Forty-five of these 148 that are present in all three genomes are especially enriched in mixed super-secondary structures (alpha-beta). Moreover, the five most common of these 45 (the "top-5") have a remarkably similar super-secondary structure architecture containing a central sheet of parallel strands with helices packed onto at least one face and beta-alpha-beta connections between adjacent strands. These most basic molecular parts, which, presumably, were present in the last common ancestor to the three kingdoms, include the TIM-barrel, Rossmann, flavodoxin, thiamin-binding, and P-loop-hydrolase folds. (C) 1997 Academic Press Limited.
引用
收藏
页码:562 / 576
页数:15
相关论文
共 91 条
  • [1] ALTMAN R, 1994, P 2 INT C INT SYST M, P19
  • [2] ISSUES IN SEARCHING MOLECULAR SEQUENCE DATABASES
    ALTSCHUL, SF
    BOGUSKI, MS
    GISH, W
    WOOTTON, JC
    [J]. NATURE GENETICS, 1994, 6 (02) : 119 - 129
  • [3] ARKIN I, 1997, IN PRESS PROTEINS ST
  • [4] UNDERLYING ORDER IN PROTEIN-SEQUENCE ORGANIZATION
    BERMAN, AL
    KOLKER, E
    TRIFONOV, EN
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (09) : 4044 - 4047
  • [5] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [6] Similarities and dissimilarities of phage genomes
    Blaisdell, BE
    Campbell, AM
    Karlin, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (12) : 5854 - 5859
  • [7] BLEASBY AJ, 1994, NUCLEIC ACIDS RES, V22, P3574
  • [8] COMPREHENSIVE SEQUENCE-ANALYSIS OF THE 182 PREDICTED OPEN READING FRAMES OF YEAST CHROMOSOME-III
    BORK, P
    OUZOUNIS, C
    SANDER, C
    SCHARF, M
    SCHNEIDER, R
    SONNHAMMER, E
    [J]. PROTEIN SCIENCE, 1992, 1 (12) : 1677 - 1690
  • [9] WHATS IN A GENOME
    BORK, P
    OUZOUNIS, C
    SANDER, C
    SCHARF, M
    SCHNEIDER, R
    SONNHAMMER, E
    [J]. NATURE, 1992, 358 (6384) : 287 - 287
  • [10] INVERTED PROTEIN-STRUCTURE PREDICTION
    BOWIE, JU
    EISENBERG, D
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1993, 3 (03) : 437 - 444