Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution

被引:314
作者
Krylov, DM [1 ]
Wolf, YI [1 ]
Rogozin, IB [1 ]
Koonin, EV [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
关键词
GENOME EVOLUTION; ORIGIN; GENERATION; DATABASE;
D O I
10.1101/gr.1589103
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Lineage-specific gene loss, to a large extent, accounts for the differences in gene repertoires between genomes, particularly among eukaryotes. We derived a parsimonious scenario of gene losses for eukaryotic orthologous groups (KOGs) from seven complete eukaryotic genomes. The scenario involves substantial gene loss in fungi, nematodes, and insects. Based on this evolutionary scenario and estimates of the divergence times between major eukaryotic phyla, we introduce a numerical measure, the propensity for gene loss (PGL). We explore the connection among the propensity of a gene to be lost in evolution (PGL value), protein sequence divergence, the effect of gene knockout on fitness, the number of protein-protein interactions, and expression level for the genes in KOGs. Significant correlations between PGL and each of these variables were detected. Genes that have a lower propensity to be lost in eukaryotic evolution accumulate fewer substitutions in their protein sequences and tend to be essential for the organism viability, tend to be highly expressed, and have many interaction partners. The dependence between PGL and gene dispensability and interactivity is much stronger than that for sequence evolution rate. Thus, propensity of a gene to be lost during evolution seems to be a direct reflection of its biological importance.
引用
收藏
页码:2229 / 2235
页数:7
相关论文
共 43 条
  • [1] ADACHI J, 1992, MOLPHY PROGRAMS MOL
  • [2] Error and attack tolerance of complex networks
    Albert, R
    Jeong, H
    Barabási, AL
    [J]. NATURE, 2000, 406 (6794) : 378 - 382
  • [3] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [4] [Anonymous], 2018, The formula: The universal laws of success
  • [5] Lineage-specific loss and divergence of functionally linked genes in eukaryotes
    Aravind, L
    Watanabe, H
    Lipman, DJ
    Koonin, EV
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (21) : 11319 - 11324
  • [6] GENETICS, PHYSIOLOGY, AND EVOLUTIONARY RELATIONSHIPS OF THE GENUS BUCHNERA - INTRACELLULAR SYMBIONTS OF APHIDS
    BAUMANN, P
    BAUMANN, L
    LAI, CY
    ROUBAKHSH, D
    MORAN, NA
    CLARK, MA
    [J]. ANNUAL REVIEW OF MICROBIOLOGY, 1995, 49 : 55 - 94
  • [7] The GRID: The General Repository for Interaction Datasets
    Breitkreutz, BJ
    Stark, C
    Tyers, M
    [J]. GENOME BIOLOGY, 2003, 4 (03)
  • [8] The modern molecular clock
    Bromham, L
    Penny, D
    [J]. NATURE REVIEWS GENETICS, 2003, 4 (03) : 216 - 224
  • [9] DAYHOFF MO, 1983, METHOD ENZYMOL, V91, P524
  • [10] Exploring the metabolic and genetic control of gene expression on a genomic scale
    DeRisi, JL
    Iyer, VR
    Brown, PO
    [J]. SCIENCE, 1997, 278 (5338) : 680 - 686