The relationship between domain duplication and recombination

被引:136
作者
Vogel, C [1 ]
Teichmann, SA [1 ]
Pereira-Leal, J [1 ]
机构
[1] MRC, Mol Biol Lab, Cambridge CB2 2QH, England
关键词
domain recombination; neutral evolution; domain versatility; domain shuffling;
D O I
10.1016/j.jmb.2004.11.050
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein domains represent the basic evolutionary units that form proteins. Domain duplication and shuffling by recombination are probably the most important forces driving protein evolution and hence the complexity of the proteome. While the duplication of whole genes as well as domain-encoding exons increases the abundance of domains in the proteome, domain shuffling increases versatility, i.e. the number of distinct contexts in which a domain can occur. Here, we describe a comprehensive, genome-wide analysis of the relationship between these two processes. We observe a strong and robust correlation between domain versatility and abundance: domains that occur more often also have many different combination partners. This supports the view that domain recombination occurs in a random way. However, we do not observe all the different combinations that are expected from a simple random recombination scenario, and this is due to frequent duplication of specific domain combinations. When we simulate the evolution of the protein repertoire considering stochastic recombination of domains followed by extensive duplication of the combinations, we approximate the observed data well. Our analyses are consistent with a stochastic process that governs domain recombination and thus protein divergence with respect to domains within a polypeptide chain. At the same time, they support a scenario in which domain combinations are formed only once during the evolution of the protein repertoire, and are then duplicated to various extents. The extent of duplication of different combinations varies widely and, in nature, will depend on selection for the domain combination based on its function. Some of the pair-wise domain combinations that are highly duplicated also recur frequently with other partner domains, and thus represent evolutionary units larger than single protein domains, which we term "supra-domains". (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:355 / 365
页数:11
相关论文
共 46 条
  • [1] SCOP database in 2004: refinements integrate structure and sequence family data
    Andreeva, A
    Howorth, D
    Brenner, SE
    Hubbard, TJP
    Chothia, C
    Murzin, AG
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D226 - D229
  • [2] Domain combinations in archaeal, eubacterial and eukaryotic proteomes
    Apic, G
    Gough, J
    Teichmann, SA
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) : 311 - 325
  • [3] Apic Gordana, 2003, Journal of Structural and Functional Genomics, V4, P67, DOI 10.1023/A:1026113408773
  • [4] Origin of multicellular eukaryotes - insights from proteome comparisons
    Aravind, L
    Subramanian, G
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 1999, 9 (06) : 688 - 694
  • [5] Emergence of scaling in random networks
    Barabási, AL
    Albert, R
    [J]. SCIENCE, 1999, 286 (5439) : 509 - 512
  • [6] The geometry of domain combination in proteins
    Bashton, M
    Chothia, C
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2002, 315 (04) : 927 - 939
  • [7] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
  • [8] Comparison of the complete protein sets of worm and yeast: Orthology and divergence
    Chervitz, SA
    Aravind, L
    Sherlock, G
    Ball, CA
    Koonin, EV
    Dwight, SS
    Harris, MA
    Dolinski, K
    Mohr, S
    Smith, T
    Weng, S
    Cherry, JM
    Botstein, D
    [J]. SCIENCE, 1998, 282 (5396) : 2022 - 2028
  • [9] Evolution of the protein repertoire
    Chothia, C
    Gough, J
    Vogel, C
    Teichmann, SA
    [J]. SCIENCE, 2003, 300 (5626) : 1701 - 1703
  • [10] Darwin C. R., 1859, ORIGIN SPECIES MEANS