Genomic and structural aspects of protein evolution

被引:75
作者
Chothia, Cyrus [1 ]
Gough, Julian
机构
[1] MRC, Mol Biol Lab, Cambridge CB2 0QH, England
关键词
biological complexity; domain duplication; domain superfamily; evolution; genome; protein structure; CAENORHABDITIS-ELEGANS; GENE DUPLICATIONS; SEQUENCE; FAMILY; BACTERIAL; ENZYME; SUPERFAMILIES; STABILITY; FOLD; DETERMINANTS;
D O I
10.1042/BJ20090122
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It has been known for more than 35 years that, during evolution, new proteins ire formed by gene duplications, sequence and structural divergence and, in many cases, gene combinations. The genome projects have produced complete, or almost complete, descriptions of file protein repertoires of over 600 distinct organisms. Analyses of these data have dramatically increased our understanding of the formation of new proteins, At the present time, we can accurately trace the evolutionary relationships of about half the proteins found in most genomes, and it is these proteins that we discuss in the present review. Usually. the units of evolution are protein domain,, that are duplicated, diverge and form combinations. Small proteins contain one domain, and large proteins contain combinations of two or more domains, Domains descended from a common ancestor are clustered into superfamilies. In most genomes, the net growth of superfamily members means that more than 90% of domains are duplicates. In a section on domain duplications, we discuss the number of currently known superfamilies, their size and distribution, and superfamily expansions related to biological complexity and to specific lineages. In a section on divergence, we describe how sequences and structures diverge, the changes in stability produced by acceptable mutations, and the nature of functional divergence and selection. In a section oil domain combinations, we discuss their general nature, the sequential order of domains, how combinations modify function, and the extraordinary variety of the domain combinations found in different genomes. We conclude with a brief note on other forms of protein evolution and speculations of the origins of the duplication, divergence and combination processes.
引用
收藏
页码:15 / 28
页数:14
相关论文
共 84 条
  • [31] Fold change in evolution of protein structures
    Grishin, NV
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2001, 134 (2-3) : 167 - 185
  • [32] Conservation of folding and stability within a protein family: The tyrosine corner as an evolutionary cul-de-sac
    Hamill, SJ
    Cota, E
    Chothia, C
    Clarke, J
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 295 (03) : 641 - 649
  • [33] Cadherin superfamily proteins in Caenorhabditis elegans and Drosophila melanogaster
    Hill, E
    Broadbent, ID
    Chothia, C
    Pettitt, J
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 305 (05) : 1011 - 1024
  • [34] The frequency distribution of gene family sizes in complete genomes
    Huynen, MA
    van Nimwegen, E
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (05) : 583 - 589
  • [35] ENZYME RECRUITMENT IN EVOLUTION OF NEW FUNCTION
    JENSEN, RA
    [J]. ANNUAL REVIEW OF MICROBIOLOGY, 1976, 30 : 409 - 425
  • [36] Lineage-specific gene expansions in bacterial and archaeal genomes
    Jordan, IK
    Makarova, KS
    Spouge, JL
    Wolf, YI
    Koonin, EV
    [J]. GENOME RESEARCH, 2001, 11 (04) : 555 - 565
  • [37] Origin and evolution of eukaryotic apoptosis: the bacterial connection
    Koonin, EV
    Aravind, L
    [J]. CELL DEATH AND DIFFERENTIATION, 2002, 9 (04) : 394 - 404
  • [38] The impact of comparative genomics on our understanding of evolution
    Koonin, EV
    Aravind, L
    Kondrashov, AS
    [J]. CELL, 2000, 101 (06) : 573 - 576
  • [39] The structure of the protein universe and genome evolution
    Koonin, EV
    Wolf, YI
    Karev, GP
    [J]. NATURE, 2002, 420 (6912) : 218 - 223
  • [40] Alternative splicing and gene duplication are inversely correlated evolutionary mechanisms
    Kopelman, NM
    Lancet, D
    Yanai, I
    [J]. NATURE GENETICS, 2005, 37 (06) : 588 - 589