Measuring genome evolution

被引:329
作者
Huynen, MA
Bork, P
机构
[1] European Mol Biol Lab, D-69012 Heidelberg, Germany
[2] Max Delbruck Ctr Mol Med, D-13122 Berlin, Germany
关键词
ortholog; synteny; computer analysis; horizontal gene transfer;
D O I
10.1073/pnas.95.11.5849
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The determination of complete genome sequences provides us with an opportunity to describe and analyze evolution at the comprehensive level of genomes. Here we compare nine genomes with respect to their protein coding genes at two levels: (i) we compare genomes as "bags of genes" and measure the fraction of orthologs shared between genomes and (ii) we quantify correlations between genes with respect to their relative positions in genomes. Distances between the genomes are related to their divergence times, measured as the number of amino acid substitutions per site in a set of 34 orthologous genes that are shared among all the genomes compared. We establish a hierarchy of rates at which genomes have changed during evolution. Protein sequence identity is the most conserved, followed by the complement of genes within the genome. Next is the degree of conservation of the order of genes, whereas gene regulation appears to evolve at the highest rate. Finally, we show that some genomes are more highly organized than others: they show a higher degree of the clustering of genes that have orthologs in other genomes.
引用
收藏
页码:5849 / 5856
页数:8
相关论文
共 47 条
  • [11] Fisher R. A., 1999, The Genetical Theory of Natural Selection: A Complete Variorum Edition
  • [12] DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS
    FITCH, WM
    [J]. SYSTEMATIC ZOOLOGY, 1970, 19 (02): : 99 - &
  • [13] WHOLE-GENOME RANDOM SEQUENCING AND ASSEMBLY OF HAEMOPHILUS-INFLUENZAE RD
    FLEISCHMANN, RD
    ADAMS, MD
    WHITE, O
    CLAYTON, RA
    KIRKNESS, EF
    KERLAVAGE, AR
    BULT, CJ
    TOMB, JF
    DOUGHERTY, BA
    MERRICK, JM
    MCKENNEY, K
    SUTTON, G
    FITZHUGH, W
    FIELDS, C
    GOCAYNE, JD
    SCOTT, J
    SHIRLEY, R
    LIU, LI
    GLODEK, A
    KELLEY, JM
    WEIDMAN, JF
    PHILLIPS, CA
    SPRIGGS, T
    HEDBLOM, E
    COTTON, MD
    UTTERBACK, TR
    HANNA, MC
    NGUYEN, DT
    SAUDEK, DM
    BRANDON, RC
    FINE, LD
    FRITCHMAN, JL
    FUHRMANN, JL
    GEOGHAGEN, NSM
    GNEHM, CL
    MCDONALD, LA
    SMALL, KV
    FRASER, CM
    SMITH, HO
    VENTER, JC
    [J]. SCIENCE, 1995, 269 (5223) : 496 - 512
  • [14] THE MINIMAL GENE COMPLEMENT OF MYCOPLASMA-GENITALIUM
    FRASER, CM
    GOCAYNE, JD
    WHITE, O
    ADAMS, MD
    CLAYTON, RA
    FLEISCHMANN, RD
    BULT, CJ
    KERLAVAGE, AR
    SUTTON, G
    KELLEY, JM
    FRITCHMAN, JL
    WEIDMAN, JF
    SMALL, KV
    SANDUSKY, M
    FUHRMANN, J
    NGUYEN, D
    UTTERBACK, TR
    SAUDEK, DM
    PHILLIPS, CA
    MERRICK, JM
    TOMB, JF
    DOUGHERTY, BA
    BOTT, KF
    HU, PC
    LUCIER, TS
    PETERSON, SN
    SMITH, HO
    HUTCHISON, CA
    VENTER, JC
    [J]. SCIENCE, 1995, 270 (5235) : 397 - 403
  • [15] Avoidance of palindromic words in bacterial and archaeal genomes: A close connection with restriction enzymes
    Gelfand, MS
    Koonin, EV
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (12) : 2430 - 2439
  • [16] GRISHIN NV, 1995, J MOL EVOL, V41, P675
  • [17] IDENTIFYING CONSTRAINTS ON THE HIGHER-ORDER STRUCTURE OF RNA - CONTINUED DEVELOPMENT AND APPLICATION OF COMPARATIVE SEQUENCE-ANALYSIS METHODS
    GUTELL, RR
    POWER, A
    HERTZ, GZ
    PUTZ, EJ
    STORMO, GD
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (21) : 5785 - 5795
  • [18] Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae
    Himmelreich, R
    Hilbert, H
    Plagens, H
    Pirkl, E
    Li, BC
    Herrmann, R
    [J]. NUCLEIC ACIDS RESEARCH, 1996, 24 (22) : 4420 - 4449
  • [19] DNA repeats identify novel virulence genes in Haemophilus influenzae
    Hood, DW
    Deadman, ME
    Jennings, MP
    Bisercic, M
    Fleischmann, RC
    Venter, JC
    Moxon, ER
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (20) : 11121 - 11125
  • [20] Huynen MA, 1997, TRENDS GENET, V13, P389