Compensation for nucleotide bias in a genome by representation as a discrete channel with noise

被引:11
作者
Schreiber, M [1 ]
Brown, C [1 ]
机构
[1] Univ Otago, Dept Biochem, Dunedin, New Zealand
关键词
D O I
10.1093/bioinformatics/18.4.507
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Calculation of the information content of motifs in genomes highly biased in nucleotide composition is likely to lead to overestimates of the amount of useful information in the motif. Calculating relative information can compensate for biases, however the resulting information content is the amount seen by an observer and not by a macromolecule binding to the motif. The latter is needed to calculate the discriminatory power of the motif and to compare motifs between species. Results: By treating a biased genome as a discrete channel with noise, in accordance with Shannon Information Theory, we were able to remove both 'Distortion' and 'Noise' from the motif and recover a more instructive biological 'signal.' A Java application, LogoPaint, was developed to remove nucleotide bias distortion and triplet frequency noise from motifs, calculate information content and present the motif as a logo. We demonstrate how this technique can 'unmask' motifs in the translation initiation regions of bacteria that are obscured by strong sequence biases. Availability: LogoPaint is available to all users from the authors as an executable JAR file. Source code is available by arrangement. Contact: chris.brown@stonebow.otago.ac.nz.
引用
收藏
页码:507 / 512
页数:6
相关论文
共 17 条
[1]   CODON-ANTICODON PAIRING - WOBBLE HYPOTHESIS [J].
CRICK, FHC .
JOURNAL OF MOLECULAR BIOLOGY, 1966, 19 (02) :548-&
[2]   Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi [J].
Fraser, CM ;
Casjens, S ;
Huang, WM ;
Sutton, GG ;
Clayton, R ;
Lathigra, R ;
White, O ;
Ketchum, KA ;
Dodson, R ;
Hickey, EK ;
Gwinn, M ;
Dougherty, B ;
Tomb, JF ;
Fleischmann, RD ;
Richardson, D ;
Peterson, J ;
Kerlavage, AR ;
Quackenbush, J ;
Salzberg, S ;
Hanson, M ;
vanVugt, R ;
Palmer, N ;
Adams, MD ;
Gocayne, J ;
Weidman, J ;
Utterback, T ;
Watthey, L ;
McDonald, L ;
Artiach, P ;
Bowman, C ;
Garland, S ;
Fujii, C ;
Cotton, MD ;
Horst, K ;
Roberts, K ;
Hatch, B ;
Smith, HO ;
Venter, JC .
NATURE, 1997, 390 (6660) :580-586
[3]   Transterm: a database of messenger RNA components and signals [J].
Jacobs, GH ;
Stockwell, PA ;
Schrieber, MJ ;
Tate, WP ;
Brown, CM .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :293-295
[4]   Codon usages in different gene classes of the Escherichia coli genome [J].
Karlin, S ;
Mrázek, J ;
Campbell, AM .
MOLECULAR MICROBIOLOGY, 1998, 29 (06) :1341-1355
[5]   The Comprehensive Microbial Resource [J].
Peterson, JD ;
Umayam, LA ;
Dickinson, T ;
Hickey, EK ;
White, O .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :123-125
[6]   The role of the AUU initiation codon in the negative feedback regulation of the gene for translation initiation factor IF3 in Escherichia coli [J].
Sacerdot, C ;
Chiaruttini, C ;
Engst, K ;
Graffe, M ;
Milet, M ;
Mathy, N ;
Dondon, J ;
Springer, M .
MOLECULAR MICROBIOLOGY, 1996, 21 (02) :331-346
[7]   INFORMATION-CONTENT OF BINDING-SITES ON NUCLEOTIDE-SEQUENCES [J].
SCHNEIDER, TD ;
STORMO, GD ;
GOLD, L ;
EHRENFEUCHT, A .
JOURNAL OF MOLECULAR BIOLOGY, 1986, 188 (03) :415-431
[8]   THEORY OF MOLECULAR MACHINES .2. ENERGY-DISSIPATION FROM MOLECULAR MACHINES [J].
SCHNEIDER, TD .
JOURNAL OF THEORETICAL BIOLOGY, 1991, 148 (01) :125-137
[9]   Measuring molecular information [J].
Schneider, TD .
JOURNAL OF THEORETICAL BIOLOGY, 1999, 201 (01) :87-92
[10]   THEORY OF MOLECULAR MACHINES .1. CHANNEL CAPACITY OF MOLECULAR MACHINES [J].
SCHNEIDER, TD .
JOURNAL OF THEORETICAL BIOLOGY, 1991, 148 (01) :83-123