Analysis of codon usage patterns of bacterial genomes using the self-organizing map

被引:38
作者
Wang, HC
Badger, J
Kearney, P
Li, M
机构
[1] Inst Basic Med Sci, Beijing 100850, Peoples R China
[2] Univ Waterloo, Dept Comp Sci, Waterloo, ON N2L 3G1, Canada
关键词
codon usage; self-organizing map; genome; gene function; horizontal gene transfer;
D O I
10.1093/oxfordjournals.molbev.a003861
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Codon usage varies both between organisms and between different genes in the same organism. This observation has been used as a basis for earlier work in identifying highly expressed and horizontally transferred genes in Escherichia coli. In this work, we applied Kohonen's self-organizing map to analysis of the codon usage pattern of the Escherichia coli, Aquifex aeolicus, Archaeoglobus fulgidus, Haemophilus influenzae Rd., Methanococcus jannaschii, Methanobacterium thermoautotrophicum, and Pyrococcus horikoshii genomes for evidence of highly expressed genes and horizontally transferred genes. All of the analyzed genomes had a clear category of horizontally transferred genes, and their apparent percentages ranged from 7.7% to 21.4%. The apparent percentage of highly expressed genes ranges from 0% to 11.8%. A clustering of average codon usage of main gene categories of the seven genomes showed an interesting mixing of gene classes in four thermophilic/hyperthermophilic organisms, A. aeolicus, A. fulgidus, M. thermoautotrophicum, and P. horikoshii, which suggests possible origins of their horizon tally transferred genes as well as the need for adaptation to a specific environment. Further classification of the three gene categories in E. coli and H. influenzae according to gene function revealed that genes involved in communication (such as regulation and cell process) and structure (cell structure and structural proteins) are more likely to be horizontally transferred than are genes involved in information (transcription, translation, and related processes) and in some groups of energy (such as energy metabolism and carbon compound catabolism).
引用
收藏
页码:792 / 800
页数:9
相关论文
共 32 条
  • [1] BADGER JH, 1999, THESIS U ILLINOIS UR
  • [2] The complete genome sequence of Escherichia coli K-12
    Blattner, FR
    Plunkett, G
    Bloch, CA
    Perna, NT
    Burland, V
    Riley, M
    ColladoVides, J
    Glasner, JD
    Rode, CK
    Mayhew, GF
    Gregor, J
    Davis, NW
    Kirkpatrick, HA
    Goeden, MA
    Rose, DJ
    Mau, B
    Shao, Y
    [J]. SCIENCE, 1997, 277 (5331) : 1453 - +
  • [3] Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii
    Bult, CJ
    White, O
    Olsen, GJ
    Zhou, LX
    Fleischmann, RD
    Sutton, GG
    Blake, JA
    FitzGerald, LM
    Clayton, RA
    Gocayne, JD
    Kerlavage, AR
    Dougherty, BA
    Tomb, JF
    Adams, MD
    Reich, CI
    Overbeek, R
    Kirkness, EF
    Weinstock, KG
    Merrick, JM
    Glodek, A
    Scott, JL
    Geoghagen, NSM
    Weidman, JF
    Fuhrmann, JL
    Nguyen, D
    Utterback, TR
    Kelley, JM
    Peterson, JD
    Sadow, PW
    Hanna, MC
    Cotton, MD
    Roberts, KM
    Hurst, MA
    Kaine, BP
    Borodovsky, M
    Klenk, HP
    Fraser, CM
    Smith, HO
    Woese, CR
    Venter, JC
    [J]. SCIENCE, 1996, 273 (5278) : 1058 - 1073
  • [4] The complete genome of the hyperthermophilic bacterium Aquifex aeolicus
    Deckert, G
    Warren, PV
    Gaasterland, T
    Young, WG
    Lenox, AL
    Graham, DE
    Overbeek, R
    Snead, MA
    Keller, M
    Aujay, M
    Huber, R
    Feldman, RA
    Short, JM
    Olsen, GJ
    Swanson, RV
    [J]. NATURE, 1998, 392 (6674) : 353 - 358
  • [5] Phylogenetic classification and the universal tree
    Doolittle, WF
    [J]. SCIENCE, 1999, 284 (5423) : 2124 - 2128
  • [6] WHOLE-GENOME RANDOM SEQUENCING AND ASSEMBLY OF HAEMOPHILUS-INFLUENZAE RD
    FLEISCHMANN, RD
    ADAMS, MD
    WHITE, O
    CLAYTON, RA
    KIRKNESS, EF
    KERLAVAGE, AR
    BULT, CJ
    TOMB, JF
    DOUGHERTY, BA
    MERRICK, JM
    MCKENNEY, K
    SUTTON, G
    FITZHUGH, W
    FIELDS, C
    GOCAYNE, JD
    SCOTT, J
    SHIRLEY, R
    LIU, LI
    GLODEK, A
    KELLEY, JM
    WEIDMAN, JF
    PHILLIPS, CA
    SPRIGGS, T
    HEDBLOM, E
    COTTON, MD
    UTTERBACK, TR
    HANNA, MC
    NGUYEN, DT
    SAUDEK, DM
    BRANDON, RC
    FINE, LD
    FRITCHMAN, JL
    FUHRMANN, JL
    GEOGHAGEN, NSM
    GNEHM, CL
    MCDONALD, LA
    SMALL, KV
    FRASER, CM
    SMITH, HO
    VENTER, JC
    [J]. SCIENCE, 1995, 269 (5223) : 496 - 512
  • [7] CODON CATALOG USAGE AND THE GENOME HYPOTHESIS
    GRANTHAM, R
    GAUTIER, C
    GOUY, M
    MERCIER, R
    PAVE, A
    [J]. NUCLEIC ACIDS RESEARCH, 1980, 8 (01) : R49 - R62
  • [9] IKEMURA T, 1985, MOL BIOL EVOL, V2, P13
  • [10] Horizontal gene transfer among genomes: The complexity hypothesis
    Jain, R
    Rivera, MC
    Lake, JA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (07) : 3801 - 3806