Investigations of oligonucleotide usage variance within and between prokaryotes

被引:42
作者
Bohlin, Jon [1 ]
Skjerve, Eystein [1 ]
Ussery, David W. [2 ]
机构
[1] Norwegian Sch Vet Sci, Oslo, Norway
[2] Tech Univ Denmark, Dept Syst Biol, Ctr Biol Sequence Anal, Lyngby, Denmark
关键词
D O I
10.1371/journal.pcbi.1000057
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Oligonucleotide usage in archaeal and bacterial genomes can be linked to a number of properties, including codon usage (trinucleotides), DNA base-stacking energy (dinucleotides), and DNA structural conformation (di-to tetranucleotides). We wanted to assess the statistical information potential of different DNA 'word-sizes' and explore how oligonucleotide frequencies differ in coding and non-coding regions. In addition, we used oligonucleotide frequencies to investigate DNA composition and how DNA sequence patterns change within and between prokaryotic organisms. Among the results found was that prokaryotic chromosomes can be described by hexanucleotide frequencies, suggesting that prokaryotic DNA is predominantly short range correlated, i. e., information in prokaryotic genomes is encoded in short oligonucleotides. Oligonucleotide usage varied more within AT-rich and host-associated genomes than in GC-rich and free-living genomes, and this variation was mainly located in non-coding regions. Bias (selectional pressure) in tetranucleotide usage correlated with GC content, and coding regions were more biased than non-coding regions. Non-coding regions were also found to be approximately 5.5% more AT-rich than coding regions, on average, in the 402 chromosomes examined. Pronounced DNA compositional differences were found both within and between AT-rich and GC-rich genomes. GC-rich genomes were more similar and biased in terms of tetranucleotide usage in non-coding regions than AT-rich genomes. The differences found between AT-rich and GC-rich genomes may possibly be attributed to lifestyle, since tetranucleotide usage within host-associated bacteria was, on average, more dissimilar and less biased than free-living archaea and bacteria.
引用
收藏
页数:9
相关论文
共 20 条
[1]   Long-range periodic patterns in microbial genomes indicate significant multi-scale chromosomal organization [J].
Allen, Timothy E. ;
Price, Nathan D. ;
Joyce, Andrew R. ;
Palsson, Bernhard O. .
PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (01) :13-21
[2]   Seven GC-rich microbial genomes adopt similar codon usage patterns regardless of their phylogenetic lineages [J].
Chen, LL ;
Zhang, CT .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 306 (01) :310-317
[3]   Environments shape the nucleotide composition of genomes [J].
Foerstner, KU ;
von Mering, C ;
Hooper, SD ;
Bork, P .
EMBO REPORTS, 2005, 6 (12) :1208-1213
[4]   The genome sequence of Blochmannia floridanus:: Comparative analysis of reduced genomes [J].
Gil, R ;
Silva, FJ ;
Zientz, E ;
Delmotte, F ;
González-Candelas, F ;
Latorre, A ;
Rausell, C ;
Kamerbeek, J ;
Gadau, J ;
Hölldobler, B ;
van Ham, RCHJ ;
Gross, R ;
Moya, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (16) :9388-9393
[5]   Compositional biases of bacterial genomes and evolutionary implications [J].
Karlin, S ;
Mrazek, J ;
Campbell, AM .
JOURNAL OF BACTERIOLOGY, 1997, 179 (12) :3899-3913
[6]   Trends between gene content and genome size in prokaryotic species with larger genomes [J].
Konstantinidis, KT ;
Tiedje, JM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (09) :3160-3165
[7]   Involvement of DNA curvature in intergenic regions of prokaryotes [J].
Kozobay-Avraham, Limor ;
Hosid, Sergey ;
Bolshoy, Alexander .
NUCLEIC ACIDS RESEARCH, 2006, 34 (08) :2316-2327
[8]   Identification of coding and non-coding sequences using local Holder exponent formalism [J].
Kulkarni, OC ;
Vigneshwar, R ;
Jayaraman, VK ;
Kulkarni, BD .
BIOINFORMATICS, 2005, 21 (20) :3818-3823
[9]   The correlation between genomic G+C and optimal growth temperature of prokaryotes is robust: A reply to Marashi and Ghalanbor [J].
Musto, H ;
Naya, H ;
Zavala, A ;
Romero, H ;
Alvarez-Valin, F ;
Bernardi, G .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2005, 330 (02) :357-360
[10]   Genomic GC level, optimal growth temperature, and genome size in prokaryotes [J].
Musto, Hector ;
Naya, Hugo ;
Zavala, Alejandro ;
Romero, Hector ;
Alvarez-Valin, Fernando ;
Bernardi, Giorgio .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2006, 347 (01) :1-3