Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution

被引:314
作者
Krylov, DM [1 ]
Wolf, YI [1 ]
Rogozin, IB [1 ]
Koonin, EV [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
关键词
GENOME EVOLUTION; ORIGIN; GENERATION; DATABASE;
D O I
10.1101/gr.1589103
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Lineage-specific gene loss, to a large extent, accounts for the differences in gene repertoires between genomes, particularly among eukaryotes. We derived a parsimonious scenario of gene losses for eukaryotic orthologous groups (KOGs) from seven complete eukaryotic genomes. The scenario involves substantial gene loss in fungi, nematodes, and insects. Based on this evolutionary scenario and estimates of the divergence times between major eukaryotic phyla, we introduce a numerical measure, the propensity for gene loss (PGL). We explore the connection among the propensity of a gene to be lost in evolution (PGL value), protein sequence divergence, the effect of gene knockout on fitness, the number of protein-protein interactions, and expression level for the genes in KOGs. Significant correlations between PGL and each of these variables were detected. Genes that have a lower propensity to be lost in eukaryotic evolution accumulate fewer substitutions in their protein sequences and tend to be essential for the organism viability, tend to be highly expressed, and have many interaction partners. The dependence between PGL and gene dispensability and interactivity is much stronger than that for sequence evolution rate. Thus, propensity of a gene to be lost during evolution seems to be a direct reflection of its biological importance.
引用
收藏
页码:2229 / 2235
页数:7
相关论文
共 43 条
  • [21] Do essential genes evolve slowly?
    Hurst, LD
    Smith, NGC
    [J]. CURRENT BIOLOGY, 1999, 9 (14) : 747 - 750
  • [22] Lethality and centrality in protein networks
    Jeong, H
    Mason, SP
    Barabási, AL
    Oltvai, ZN
    [J]. NATURE, 2001, 411 (6833) : 41 - 42
  • [23] THE RAPID GENERATION OF MUTATION DATA MATRICES FROM PROTEIN SEQUENCES
    JONES, DT
    TAYLOR, WR
    THORNTON, JM
    [J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1992, 8 (03): : 275 - 282
  • [24] No simple dependence between protein evolution rate and the number of protein-protein interactions: only the most prolific interactors tend to evolve slowly
    Jordan, IK
    Wolf, YI
    Koonin, EV
    [J]. BMC EVOLUTIONARY BIOLOGY, 2003, 3 (1)
  • [25] Essential genes are more evolutionarily conserved than are nonessential genes in bacteria
    Jordan, IK
    Rogozin, IB
    Wolf, YI
    Koonin, EV
    [J]. GENOME RESEARCH, 2002, 12 (06) : 962 - 968
  • [26] Systematic functional analysis of the Caenorhabditis elegans genome using RNAi
    Kamath, RS
    Fraser, AG
    Dong, Y
    Poulin, G
    Durbin, R
    Gotta, M
    Kanapin, A
    Le Bot, N
    Moreno, S
    Sohrmann, M
    Welchman, DP
    Zipperlen, P
    Ahringer, J
    [J]. NATURE, 2003, 421 (6920) : 231 - 237
  • [27] Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi
    Katinka, MD
    Duprat, S
    Cornillot, E
    Méténier, G
    Thomarat, F
    Prensier, G
    Barbe, V
    Peyretaillade, E
    Brottier, P
    Wincker, P
    Delbac, F
    El Alaoui, H
    Peyret, P
    Saurin, W
    Gouy, M
    Weissenbach, J
    Vivarès, CP
    [J]. NATURE, 2001, 414 (6862) : 450 - 453
  • [28] Kimura M., 1983, The Neutral Theory of Molecular Evolution
  • [29] Complete genome sequences of cellular life forms: Glimpses of theoretical evolutionary genomics
    Koonin, EV
    Mushegian, AR
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 1996, 6 (06) : 757 - 762
  • [30] KUMAR S, 1994, COMPUT APPL BIOSCI, V10, P189