EcoGene:: a genome sequence database for Escherichia coli K-12

被引:196
作者
Rudd, KE [1 ]
机构
[1] Univ Miami, Sch Med, Dept Biochem & Mol Biol, Miami, FL 33101 USA
关键词
D O I
10.1093/nar/28.1.60
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The EcoGene database provides a set of gene and protein sequences derived from the genome sequence of Escherichia coli K-12, EcoGene is a source of re-annotated sequences for the SWISS-PROT and Colibri databases. EcoGene is used for genetic and physical map compilations in collaboration with the Coil Genetic Stock Center. The EcoGene12 release includes 4293 genes. EcoGene12 differs from the GenBank annotation of the complete genome sequence in several ways, including (i) the revision of 706 predicted or confirmed gene start sites, (ii) the correction or hypothetical reconstruction of 61 frameshifts caused by either sequence error or mutation, (iii) the reconstruction of 14 protein sequences interrupted by the insertion of IS elements, and (iv) predictions that 92 genes are partially deleted gene fragments. A literature survey identified 717 proteins whose N-terminal amino acids have been verified by sequencing. 12 446 cross-references to 6835 literature citations and abstracts are provided. EcoGene is accessible at a new website: http://bmb.med.miami.edu/EcoGene/EcoWeb. Users can search and retrieve individual EcoGene GenePages or they can download large datasets for incorporation into database management systems, facilitating various genome-scale computational and functional analyses.
引用
收藏
页码:60 / 64
页数:5
相关论文
共 14 条
[1]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[2]   The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :49-54
[3]   Linkage map of Escherichia coli K-12, edition 10: The traditional map [J].
Berlyn, MKB .
MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 1998, 62 (03) :814-+
[4]  
BERLYN MKB, 1996, ESCHERICHIA COLI SAL, V2, P1715
[5]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[6]   NEW GENES IN OLD SEQUENCE - A STRATEGY FOR FINDING GENES IN THE BACTERIAL GENOME [J].
BORODOVSKY, M ;
KOONIN, EV ;
RUDD, KE .
TRENDS IN BIOCHEMICAL SCIENCES, 1994, 19 (08) :309-313
[7]   Eco Cyc:: Encyclopedia of Escherichia coli genes and metabolism [J].
Karp, PD ;
Riley, M ;
Paley, SM ;
Pellegrini-Toole, A ;
Krummenacker, M .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :55-58
[8]   The EcoCyc and MetaCyc databases [J].
Karp, PD ;
Riley, M ;
Saier, M ;
Paulsen, IT ;
Paley, SM ;
Pellegrini-Toole, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :56-59
[9]  
MEDIGUE C, 1993, MICROBIOL REV, V57, P623
[10]   MAPPING SEQUENCED ESCHERICHIA-COLI GENES BY COMPUTER - SOFTWARE, STRATEGIES AND EXAMPLES [J].
RUDD, KE ;
MILLER, W ;
WERNER, C ;
OSTELL, J ;
TOLSTOSHEV, C ;
SATTERFIELD, SG .
NUCLEIC ACIDS RESEARCH, 1991, 19 (03) :637-647