Separate base usages of genes located on the leading and lagging strands in Chlamydia muridarum revealed by the Z curve method

被引:27
作者
Guo, Feng-Biao [1 ]
Yu, Xiu-Juan
机构
[1] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Chengdu 610054, Peoples R China
[2] Sichuan Univ, W China Med Sch, Dept Clin Med, Chengdu 610091, Peoples R China
关键词
D O I
10.1186/1471-2164-8-366
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The nucleotide compositional asymmetry between the leading and lagging strands in bacterial genomes has been the subject of intensive study in the past few years. It is interesting to mention that almost all bacterial genomes exhibit the same kind of base asymmetry. This work aims to investigate the strand biases in Chlamydia muridarum genome and show the potential of the Z curve method for quantitatively differentiating genes on the leading and lagging strands. Results: The occurrence frequencies of bases of protein-coding genes in C. muridarum genome were analyzed by the Z curve method. It was found that genes located on the two strands of replication have distinct base usages in C. muridarum genome. According to their positions in the 9-D space spanned by the variables u(1) - u(9) of the Z curve method, K-means clustering algorithm can assign about 94% of genes to the correct strands, which is a few percent higher than those correctly classified by K-means based on the RSCU. The base usage and codon usage analyses show that genes on the leading strand have more G than C and more T than A, particularly at the third codon position. For genes on the lagging strand the biases is reverse. The y component of the Z curves for the complete chromosome sequences show that the excess of G over C and T over A are more remarkable in C. muridarum genome than in other bacterial genomes without separating base and/or codon usages. Furthermore, for the genomes of Borrelia burgdorferi, Treponema pallidum, Chlamydia muridarum and Chlamydia trachomatis, in which distinct base and/or codon usages have been observed, closer phylogenetic distance is found compared with other bacterial genomes. Conclusion: The nature of the strand biases of base composition in C. muridarum is similar to that in most other bacterial genomes. However, the base composition asymmetry between the leading and lagging strands in C. muridarum is more significant than that in other bacteria. It's supposed that the remarkable strand biases of G/C and T/A are responsible for the appearance of separate base or codon usages in C. muridarum. On the other hand, the closer phylogenetic distance among the four bacterial genomes with separate base and/or codon usages is necessary rather than occasional. It's also shown that the Z curve method may be more sensitive than RSCU when being used to quantitatively analyze DNA sequences.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 36 条
  • [1] Transcription-induced mutations: Increase in C to T mutations in the nontranscribed strand during transcription in Escherichia coli
    Beletskii, A
    Bhagwat, AS
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (24) : 13919 - 13924
  • [2] The complete genome sequence of Escherichia coli K-12
    Blattner, FR
    Plunkett, G
    Bloch, CA
    Perna, NT
    Burland, V
    Riley, M
    ColladoVides, J
    Glasner, JD
    Rode, CK
    Mayhew, GF
    Gregor, J
    Davis, NW
    Kirkpatrick, HA
    Goeden, MA
    Rose, DJ
    Mau, B
    Shao, Y
    [J]. SCIENCE, 1997, 277 (5331) : 1453 - +
  • [3] NOMENCLATURE FOR INCOMPLETELY SPECIFIED BASES IN NUCLEIC-ACID SEQUENCES - RECOMMENDATIONS 1984
    CORNISHBOWDEN, A
    [J]. NUCLEIC ACIDS RESEARCH, 1985, 13 (09) : 3021 - 3030
  • [4] Dillon W.R., 1984, MULTIVARIATE ANAL ME
  • [5] Asymmetric substitution patterns: a review of possible underlying mutational or selective mechanisms
    Frank, AC
    Lobry, JR
    [J]. GENE, 1999, 238 (01) : 65 - 77
  • [6] Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi
    Fraser, CM
    Casjens, S
    Huang, WM
    Sutton, GG
    Clayton, R
    Lathigra, R
    White, O
    Ketchum, KA
    Dodson, R
    Hickey, EK
    Gwinn, M
    Dougherty, B
    Tomb, JF
    Fleischmann, RD
    Richardson, D
    Peterson, J
    Kerlavage, AR
    Quackenbush, J
    Salzberg, S
    Hanson, M
    vanVugt, R
    Palmer, N
    Adams, MD
    Gocayne, J
    Weidman, J
    Utterback, T
    Watthey, L
    McDonald, L
    Artiach, P
    Bowman, C
    Garland, S
    Fujii, C
    Cotton, MD
    Horst, K
    Roberts, K
    Hatch, B
    Smith, HO
    Venter, JC
    [J]. NATURE, 1997, 390 (6660) : 580 - 586
  • [7] Complete genome sequence of Treponema pallidum, the syphilis spirochete
    Fraser, CM
    Norris, SJ
    Weinstock, CM
    White, O
    Sutton, GG
    Dodson, R
    Gwinn, M
    Hickey, EK
    Clayton, R
    Ketchum, KA
    Sodergren, E
    Hardham, JM
    McLeod, MP
    Salzberg, S
    Peterson, J
    Khalak, H
    Richardson, D
    Howell, JK
    Chidambaram, M
    Utterback, T
    McDonald, L
    Artiach, P
    Bowman, C
    Cotton, MD
    Fujii, C
    Garland, S
    Hatch, B
    Horst, K
    Roberts, K
    Sandusky, M
    Weidman, J
    Smith, HO
    Venter, JC
    [J]. SCIENCE, 1998, 281 (5375) : 375 - 388
  • [8] A SENSITIVE GENETIC ASSAY FOR THE DETECTION OF CYTOSINE DEAMINATION - DETERMINATION OF RATE CONSTANTS AND THE ACTIVATION-ENERGY
    FREDERICO, LA
    KUNKEL, TA
    SHAW, BR
    [J]. BIOCHEMISTRY, 1990, 29 (10) : 2532 - 2537
  • [9] Guo FB, 2006, BMC BIOINFORMATICS, V7, DOI 10.1186/1471-2105-7-9
  • [10] Gene recognition based on nucleotide distribution of ORFs in a hyper-thermophilic crenarchaeon, Aeropyrum pernix K1
    Guo, FB
    Wang, J
    Zhang, CT
    [J]. DNA RESEARCH, 2004, 11 (06) : 361 - 370