Uneven distribution of GATC motifs in the Escherichia coli chromosome, its plasmids and its phages

被引:48
作者
Henaut, A
Rouxel, T
Gleizes, A
Moszer, I
Danchin, A
机构
[1] GENOMIC, F-74160 COLLONGES, FRANCE
[2] INST PASTEUR, F-75724 PARIS 15, FRANCE
关键词
tetranucleotides; statistics; genome; anaerobic growth; FNR;
D O I
10.1006/jmbi.1996.0186
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This work reconsiders the GATC motif distribution in a 1.6 Mb segment of the Escherichia coli genome, compared to its distribution in phages and plasmids. At first sight the distribution of GATC words looks random. But when a realistic model of the chromosome (made of average genes having the same codon usage as in the real chromosome), is used as a theoretical reference, strong biases are observed. GATC pairs such as GATCNNGATC are under-represented while there is a strong positive selection for motifs separated by 10, 19, 70 and 1100 bp. The last class is the only one present in E. coli parasites. It can be ascribed to the triggering sequences of the long-patch mismatch repair system. The 6 bp class overlaps with the consensus of CAP (catabolite activator protein) and FNR (fumarate/nitrate regulator) binding sites, thus accounting for counter-selection. The other classes, which could be targets for a nucleic acid binding protein, are almost always present inside protein coding sequences, and are members of clusters of GATC motifs. Analysis of the genes containing these motifs suggests that they correspond to a regulatory process monitoring the shift from anaerobic to aerobic growth conditions. In particular this regulation, closing down transcription of a large number of genes involved in intermediary metabolism would be well suited for the cold and oxygen shift from the mammal's gut tot he standard environmental conditions. In this process the methylation status of GATC clusters would be very important for tuning transcription, and a DNA binding protein, probably a member of the cold-shock proteins family would be needed or alleviating the effects mediated by slackening of the pace of methylation during the shift. (C) 1996 Academic Press Limited
引用
收藏
页码:574 / 585
页数:12
相关论文
共 45 条
[1]  
[Anonymous], UNIVERSAL TURING MAC
[2]   THE GREAT GATC - DNA METHYLATION IN ESCHERICHIA-COLI [J].
BARRAS, F ;
MARINUS, MG .
TRENDS IN GENETICS, 1989, 5 (05) :139-143
[3]  
BLATTNER FR, 1993, NUCLEIC ACIDS RES, V21, P5408
[4]   THE ESCHERICHIA-COLI REGULATORY PROTEIN OXYR DISCRIMINATES BETWEEN METHYLATED AND UNMETHYLATED STATES OF THE PHAGE MU-MOM PROMOTER [J].
BOLKER, M ;
KAHMANN, R .
EMBO JOURNAL, 1989, 8 (08) :2403-2410
[5]   QUANTITATION OF DAM METHYLTRANSFERASE IN ESCHERICHIA-COLI [J].
BOYE, E ;
MARINUS, MG ;
LOBNEROLESEN, A .
JOURNAL OF BACTERIOLOGY, 1992, 174 (05) :1682-1685
[6]   D(GATC) SEQUENCES INFLUENCE ESCHERICHIA-COLI MISMATCH REPAIR IN A DISTANCE-DEPENDENT MANNER FROM POSITIONS BOTH UPSTREAM AND DOWNSTREAM OF THE MISMATCH [J].
BRUNI, R ;
MARTIN, D ;
JIRICNY, J .
NUCLEIC ACIDS RESEARCH, 1988, 16 (11) :4875-4890
[7]   OVER-REPRESENTATION AND UNDER-REPRESENTATION OF SHORT OLIGONUCLEOTIDES IN DNA-SEQUENCES [J].
BURGE, C ;
CAMPBELL, AM ;
KARLIN, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (04) :1358-1362
[8]   DNA-SEQUENCE AND ANALYSIS OF 136 KILOBASES OF THE ESCHERICHIA-COLI GENOME - ORGANIZATIONAL SYMMETRY AROUND THE ORIGIN OF REPLICATION [J].
BURLAND, V ;
PLUNKETT, G ;
DANIELS, DL ;
BLATTNER, FR .
GENOMICS, 1993, 16 (03) :551-561
[9]   ANALYSIS OF THE ESCHERICHIA-COLI GENOME .6. DNA-SEQUENCE OF THE REGION FROM 92.8 THROUGH 100 MINUTES [J].
BURLAND, V ;
PLUNKETT, G ;
SOFIA, HJ ;
DANIELS, DL ;
BLATTNER, FR .
NUCLEIC ACIDS RESEARCH, 1995, 23 (12) :2105-2119
[10]   ESCHERICHIA-COLI ORIC AND THE DNAA GENE PROMOTER ARE SEQUESTERED FROM DAM METHYLTRANSFERASE FOLLOWING THE PASSAGE OF THE CHROMOSOMAL REPLICATION FORK [J].
CAMPBELL, JL ;
KLECKNER, N .
CELL, 1990, 62 (05) :967-979