Multidimensional annotation of the Escherichia coli K-12 genome

被引:127
作者
Karp, Peter D. [1 ]
Keseler, Ingrid M. [1 ]
Shearer, Alexander [1 ]
Latendresse, Mario [1 ]
Krummenacker, Markus [1 ]
Paley, Suzanne M. [1 ]
Paulsen, Ian [2 ,3 ]
Collado-Vides, Julio [4 ]
Gama-Castro, Socorro [4 ]
Peralta-Gil, Martin [4 ]
Santos-Zavaleta, Alberto [4 ]
Penaloza-Spinola, Monica I. [4 ]
Bonavides-Martinez, Cesar [4 ]
Ingraham, John [5 ]
机构
[1] SRI Int, Menlo Pk, CA 94025 USA
[2] J Craig Venter Inst, Rockville, MD 20850 USA
[3] Macquarie Univ, Dept Chem & Biomol Sci, Sydney, NSW 2109, Australia
[4] Univ Nacl Autonoma Mexico, Ctr Ciencias Genom, Mexico City 04510, DF, Mexico
[5] Univ Calif Davis, Davis, CA USA
关键词
D O I
10.1093/nar/gkm740
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The annotation of the Escherichia coli K-12 genome in the EcoCyc database is one of the most accurate, complete and multidimensional genome annotations. Of the 4460 E. coli genes, EcoCyc assigns biochemical functions to 76%, and 66% of all genes had their functions determined experimentally. EcoCyc assigns E. coli genes to Gene Ontology and to MultiFun. Seventy-five percent of gene products contain reviews authored by the EcoCyc project that summarize the experimental literature about the gene product. EcoCyc information was derived from 15 000 publications. The database contains extensive descriptions of E. coli cellular networks, describing its metabolic, transport and transcriptional regulatory processes. A comparison to genome annotations for other model organisms shows that the E. coli genome contains the most experimentally determined gene functions in both relative and absolute terms: 2941 (66%) for E. coli, 2319 (37%) for Saccharomyces cerevisiae, 1816 (5%) for Arabidopsis thaliana, 1456 (4%) for Mus musculus and 614 (4%) for Drosophila melanogaster. Database queries to EcoCyc survey the global properties of E. coli cellular networks and illuminate the extent of information gaps for E. coli, such as dead-end metabolites. EcoCyc provides a genome browser with novel properties, and a novel interactive display of transcriptional regulatory networks.
引用
收藏
页码:7577 / 7590
页数:14
相关论文
共 37 条
[1]   Prolinks: a database of protein functional linkages derived from coevolution [J].
Bowers, PM ;
Pellegrini, M ;
Thompson, MJ ;
Fierro, J ;
Yeates, TO ;
Eisenberg, D .
GENOME BIOLOGY, 2004, 5 (05)
[2]   Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles [J].
Faith, Jeremiah J. ;
Hayete, Boris ;
Thaden, Joshua T. ;
Mogno, Ilaria ;
Wierzbowski, Jamey ;
Cottarel, Guillaume ;
Kasif, Simon ;
Collins, James J. ;
Gardner, Timothy S. .
PLOS BIOLOGY, 2007, 5 (01) :54-66
[3]   A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information [J].
Feist, Adam M. ;
Henry, Christopher S. ;
Reed, Jennifer L. ;
Krummenacker, Markus ;
Joyce, Andrew R. ;
Karp, Peter D. ;
Broadbelt, Linda J. ;
Hatzimanikatis, Vassily ;
Palsson, Bernhard O. .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3
[4]   A Bayesian method for identifying missing enzymes in predicted metabolic pathway databases [J].
Green, ML ;
Karp, PD .
BMC BIOINFORMATICS, 2004, 5 (1)
[5]   A PHYSIOLOGICAL-ROLE FOR CYANATE-INDUCED CARBONIC-ANHYDRASE IN ESCHERICHIA-COLI [J].
GUILLOTON, MB ;
LAMBLIN, AF ;
KOZLIAK, EI ;
GERAMINEJAD, M ;
TU, C ;
SILVERMAN, D ;
ANDERSON, PM ;
FUCHS, JA .
JOURNAL OF BACTERIOLOGY, 1993, 175 (05) :1443-1451
[6]   The Gene Ontology (GO) database and informatics resource [J].
Harris, MA ;
Clark, J ;
Ireland, A ;
Lomax, J ;
Ashburner, M ;
Foulger, R ;
Eilbeck, K ;
Lewis, S ;
Marshall, B ;
Mungall, C ;
Richter, J ;
Rubin, GM ;
Blake, JA ;
Bult, C ;
Dolan, M ;
Drabkin, H ;
Eppig, JT ;
Hill, DP ;
Ni, L ;
Ringwald, M ;
Balakrishnan, R ;
Cherry, JM ;
Christie, KR ;
Costanzo, MC ;
Dwight, SS ;
Engel, S ;
Fisk, DG ;
Hirschman, JE ;
Hong, EL ;
Nash, RS ;
Sethuraman, A ;
Theesfeld, CL ;
Botstein, D ;
Dolinski, K ;
Feierbach, B ;
Berardini, T ;
Mundodi, S ;
Rhee, SY ;
Apweiler, R ;
Barrell, D ;
Camon, E ;
Dimmer, E ;
Lee, V ;
Chisholm, R ;
Gaudet, P ;
Kibbe, W ;
Kishore, R ;
Schwarz, EM ;
Sternberg, P ;
Gwinn, M .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D258-D261
[7]   Sigma70 promoters in Escherichia coli:: Specific transcription in dense regions of overlapping promoter-like signals [J].
Huerta, AM ;
Collado-Vides, J .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 333 (02) :261-278
[8]   Comparison of the small molecule metabolic enzymes of Escherichia coli and Saccharomyces cerevisiae [J].
Jardine, O ;
Gough, J ;
Chothia, C ;
Teichmann, SA .
GENOME RESEARCH, 2002, 12 (06) :916-929
[9]   From genomics to chemical genomics: new developments in KEGG [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Hattori, Masahiro ;
Aoki-Kinoshita, Kiyoko F. ;
Itoh, Masumi ;
Kawashima, Shuichi ;
Katayama, Toshiaki ;
Araki, Michihiro ;
Hirakawa, Mika .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D354-D357
[10]  
Karp PD, 2003, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2004, P190