Anatomy of Escherichia coli ribosome binding sites

被引:106
作者
Shultzaberger, RK
Bucheimer, RE
Rudd, KE
Schneider, TD
机构
[1] NCI, Lab Expt & Computat Biol, Frederick, MD 21702 USA
[2] Univ Maryland, College Pk, MD 20742 USA
[3] Univ Virginia, Sch Med, Charlottesville, VA 22908 USA
[4] Univ Miami, Sch Med, Dept Biochem & Mol Biol, Miami, FL 33101 USA
关键词
ribosome; Shine-Dalgarno; information theory; sequence logo; sequence walker;
D O I
10.1006/jmbi.2001.5040
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
During translational initiation in prokaryotes, the 3' end of the 16S rRNA binds to a region just upstream of the initiation codon. The relationship between this Shine-Dalgarno (SD) region and the binding of ribosomes to translation start-points has been well studied, but a unified mathematical connection between the SD, the initiation codon and the spacing between them has been lacking. Using information theory, we constructed a model that treats these three components uniformly by assigning to the SD and the initiation region (IR) conservations in bits of information, and by assigning to the spacing an uncertainty, also in bits. To build the model, we first aligned the SD region by maximizing the information content there. The ease of this process confirmed the existence of the SD pattern within a set of 4122 reviewed and revised Escherichia coli gene starts. This large data set allowed us to show graphically, by sequence logos, that the spacing between the SD and the initiation region affects both the SD site conservation and its pattern. We used the aligned SD, the spacing, and the initiation region to model ribosome binding and to identify gene starts that do not conform to the ribosome binding site model. A total of 569 experimentally proven starts are more conserved (have higher information content) than the full set of revised starts, which probably reflects an experimental bias against the detection of gene products that have inefficient ribosome binding sites. Models were refined cyclically by removing non-conforming weak sites. After this procedure, models derived from either the original or the revised gene start annotation were similar. Therefore, this information theory-based technique provides a method for easily constructing biologically sensible ribosome binding site models. Such models should be useful for refining gene-start predictions of any sequenced bacterial genome. (C) 2001 Academic Press.
引用
收藏
页码:215 / 228
页数:14
相关论文
共 60 条
[31]   Discrimination by Escherichia coli initiation factor IF3 against initiation on non-canonical codons relies on complementarity rules [J].
Meinnel, T ;
Sacerdot, C ;
Graffe, M ;
Blanquet, S ;
Springer, M .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 290 (04) :825-837
[32]   The structural basis of ribosome activity in peptide bond synthesis [J].
Nissen, P ;
Hansen, J ;
Ban, N ;
Moore, PB ;
Steitz, TA .
SCIENCE, 2000, 289 (5481) :920-930
[33]   INFORMATION ANALYSIS OF SEQUENCES THAT BIND THE REPLICATION INITIATOR REPA [J].
PAPP, PP ;
CHATTORAJ, DK ;
SCHNEIDER, TD .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 233 (02) :219-230
[34]  
PIERCE JR, 1980, INTRO INFORMATION TH
[35]   TRANSLATION INITIATION IN ESCHERICHIA-COLI - SEQUENCES WITHIN THE RIBOSOME-BINDING SITE [J].
RINGQUIST, S ;
SHINEDLING, S ;
BARRICK, D ;
GREEN, L ;
BINKLEY, J ;
STORMO, GD ;
GOLD, L .
MOLECULAR MICROBIOLOGY, 1992, 6 (09) :1219-1229
[36]   CONTACTS BETWEEN 16S RIBOSOMAL-RNA AND MESSENGER-RNA, WITHIN THE SPACER REGION SEPARATING THE AUG INITIATOR CODON AND THE SHINE-DALGARNO SEQUENCE - A SITE-DIRECTED CROSS-LINKING STUDY [J].
RINKEAPPEL, J ;
JUNKE, N ;
BRIMACOMBE, R ;
LAVRIK, I ;
DOKUDOVSKAYA, S ;
DONTSOVA, O ;
BOGDANOV, A .
NUCLEIC ACIDS RESEARCH, 1994, 22 (15) :3018-3025
[37]  
Rogan PK, 1998, HUM MUTAT, V12, P153, DOI 10.1002/(SICI)1098-1004(1998)12:3<153::AID-HUMU3>3.3.CO
[38]  
2-O
[39]   EcoGene:: a genome sequence database for Escherichia coli K-12 [J].
Rudd, KE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :60-64
[40]  
RUDD KE, 1992, SHORT COURSE BACTERI