SIGNIFICANT DISPERSED RECURRENT DNA-SEQUENCES IN THE ESCHERICHIA-COLI GENOME - SEVERAL NEW GROUPS

被引:40
作者
BLAISDELL, BE
RUDD, KE
MATIN, A
KARLIN, S
机构
[1] NIH,NAT LIB MED,NATL CTR BIOTECHNOL INFORMAT,BETHESDA,MD 20892
[2] STANFORD UNIV,MED CTR,DEPT MICROBIOL & IMMUNOL,STANFORD,CA 94305
关键词
STATISTICALLY SIGNIFICANTLY LONG COMMON WORDS; ESCHERICHIA-COLI; PROTEIN BINDING TRANSPORT; RHO-INDEPENDENT TRANSCRIPTION TERMINATORS; SYSTEMS INDUCIBLE BY NUTRIENT STARVATION;
D O I
10.1006/jmbi.1993.1090
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
New computer and statistical methods were used to determined significant direct and inverted repeats in the Escherichia coli contig sequence collection of aggregate 1·6 × 106 base-pairs. Eight groups of mostly new structural repeat identities were uncovered. Apart from the high statistical significance of these repeat sequence, there are suggestive relationships of the group matches in terms of neighboring genes, of genomic distributions, of their texts, and of their potentials for secondary structure. Four of these groups are relatively numerous, 11 to 26 members, one is in coding sequences and three are in non-coding. The coding group consists of the ATP-activated transmembrane component of a typical high-affinity protein-binding transport system. One of the non-coding groups consists of a special rho-independent transcription termination signal closely following an processing RNA or DNA. A second non-coding group has, for one or both neighboring genes, a component of a system responding to stress or starvation for some nutrient. © 1993 Academic Press Limited.
引用
收藏
页码:833 / 848
页数:16
相关论文
共 55 条
[1]   ORGANIZATION AND NUCLEOTIDE-SEQUENCE OF A NEW RIBOSOMAL OPERON IN ESCHERICHIA-COLI CONTAINING THE GENES FOR RIBOSOMAL-PROTEIN S2 AND ELONGATION-FACTOR TS [J].
AN, G ;
BENDIAK, DS ;
MAMELAK, LA ;
FRIESEN, JD .
NUCLEIC ACIDS RESEARCH, 1981, 9 (16) :4163-4172
[2]   CODON RECOGNITION PATTERNS AS DEDUCED FROM SEQUENCES OF THE COMPLETE SET OF TRANSFER-RNA SPECIES IN MYCOPLASMA-CAPRICOLUM - RESEMBLANCE TO MITOCHONDRIA [J].
ANDACHI, Y ;
YAMAO, F ;
MUTO, A ;
OSAWA, S .
JOURNAL OF MOLECULAR BIOLOGY, 1989, 209 (01) :37-54
[3]  
[Anonymous], 1984, LARGE DEVIATIONS APP
[4]   THE ERDOS-RENYI LAW IN DISTRIBUTION, FOR COIN TOSSING AND SEQUENCE MATCHING [J].
ARRATIA, R ;
GORDON, L ;
WATERMAN, MS .
ANNALS OF STATISTICS, 1990, 18 (02) :539-570
[5]   CRITICAL PHENOMENA IN SEQUENCE MATCHING [J].
ARRATIA, R ;
WATERMAN, MS .
ANNALS OF PROBABILITY, 1985, 13 (04) :1236-1249
[6]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2247-2248
[7]  
BLASBAND AJ, 1986, J BIOL CHEM, V26, P12723
[8]   PROSET - A FAST PROCEDURE TO CREATE NONREDUNDANT SETS OF PROTEIN SEQUENCES [J].
BRENDEL, V .
MATHEMATICAL AND COMPUTER MODELLING, 1992, 16 (6-7) :37-43
[9]   OVER-REPRESENTATION AND UNDER-REPRESENTATION OF SHORT OLIGONUCLEOTIDES IN DNA-SEQUENCES [J].
BURGE, C ;
CAMPBELL, AM ;
KARLIN, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (04) :1358-1362
[10]   ANALYSIS OF THE ESCHERICHIA-COLI GENOME - DNA-SEQUENCE OF THE REGION FROM 84.5 TO 86.5 MINUTES [J].
DANIELS, DL ;
PLUNKETT, G ;
BURLAND, V ;
BLATTNER, FR .
SCIENCE, 1992, 257 (5071) :771-778