Operons in Escherichia coli:: Genomic analyses and predictions

被引:269
作者
Salgado, H
Moreno-Hagelsieb, G
Smith, TF
Collado-Vides, J
机构
[1] Univ Nacl Autonoma Mexico, Ctr Invest Fijac Nitrogeno, Cuernavaca 62100, Morelos, Mexico
[2] Boston Univ, Biomol Engn Res Ctr, Boston, MA 02115 USA
关键词
D O I
10.1073/pnas.110147297
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The rich knowledge of operon organization in Escherichia coli, together with the completed chromosomal sequence of this bacterium, enabled us to perform an analysis of distances between genes and of functional relationships of adjacent genes in the same operon, as opposed to adjacent genes in different transcription units. We measured and demonstrated the expected tendencies of genes within operons to have much shorter intergenic distances than genes at the borders of transcription units. A clear peak at short distances between genes in the same operon contrasts with a flat frequency distribution of genes at the borders of transcription units. Also, genes in the same operon tend to have the same physiological functional class. The results of these analyses were used to implement a method to predict the genomic organization of genes into transcription units, The method has a maximum accuracy of 88% correct identification of pairs of adjacent genes to be in an operon, or at the borders of transcription units, and correctly identifies around 75% of the known transcription units when used to predict the transcription unit organization of the E, coli genome. Based on the frequency distance distributions, we estimated a total of 630 to 700 operons in E, coli, This step opens the possibility of predicting operon organization in other bacteria whose genome sequences have been finished.
引用
收藏
页码:6652 / 6657
页数:6
相关论文
共 26 条
  • [1] The complete genome sequence of Escherichia coli K-12
    Blattner, FR
    Plunkett, G
    Bloch, CA
    Perna, NT
    Burland, V
    Riley, M
    ColladoVides, J
    Glasner, JD
    Rode, CK
    Mayhew, GF
    Gregor, J
    Davis, NW
    Kirkpatrick, HA
    Goeden, MA
    Rose, DJ
    Mau, B
    Shao, Y
    [J]. SCIENCE, 1997, 277 (5331) : 1453 - +
  • [2] Conservation of gene order: a fingerprint of proteins that physically interact
    Dandekar, T
    Snel, B
    Huynen, M
    Bork, P
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) : 324 - 328
  • [3] DETERMINATION OF AN RNA STRUCTURE INVOLVED IN SPLICING INHIBITION OF A MUSCLE-SPECIFIC EXON
    DORVAL, BC
    DAUBENTONCARAFA, Y
    MARIE, J
    BRODY, E
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1991, 221 (03) : 837 - 856
  • [4] On the origin of operons and their possible role in evolution toward thermophily
    Glansdorff, N
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1999, 49 (04) : 432 - 438
  • [5] Hayes W S, 1998, Pac Symp Biocomput, P279
  • [6] Novel keto acid formate-lyase and propionate kinase enzymes are components of an anaerobic pathway in Escherichia coli that degrades L-threonine to propionate
    Hesslinger, C
    Fairhurst, SA
    Sawers, G
    [J]. MOLECULAR MICROBIOLOGY, 1998, 27 (02) : 477 - 492
  • [7] RegulonDB:: a database on transcriptional regulation in Escherichia coli
    Huerta, AM
    Salgado, H
    Thieffry, D
    Collado-Vides, J
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 55 - 59
  • [8] GENETIC REGULATORY MECHANISMS IN SYNTHESIS OF PROTEINS
    JACOB, F
    MONOD, J
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1961, 3 (03) : 318 - +
  • [9] Eco Cyc:: Encyclopedia of Escherichia coli genes and metabolism
    Karp, PD
    Riley, M
    Paley, SM
    Pellegrini-Toole, A
    Krummenacker, M
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 55 - 58
  • [10] Lawrence JG, 1996, GENETICS, V143, P1843