SEED Servers: High-Performance Access to the SEED Genomes, Annotations, and Metabolic Models

被引:156
作者
Aziz, Ramy K. [1 ,3 ,4 ]
Devoid, Scott [1 ,2 ]
Disz, Terrence [1 ,2 ]
Edwards, Robert A. [2 ,3 ]
Henry, Christopher S. [1 ,2 ]
Olsen, Gary J. [5 ]
Olson, Robert [1 ,2 ]
Overbeek, Ross
Parrello, Bruce
Pusch, Gordon D.
Stevens, Rick L. [1 ]
Vonstein, Veronika
Xia, Fangfang [1 ,2 ]
机构
[1] Univ Chicago, Computat Inst, Chicago, IL 60637 USA
[2] Argonne Natl Lab, Math & Comp Sci Div, Argonne, IL 60439 USA
[3] San Diego State Univ, Dept Comp Sci, San Diego, CA 92182 USA
[4] Cairo Univ, Fac Pharm, Dept Microbiol & Immunol, Cairo, Egypt
[5] Univ Illinois, Dept Microbiol, Urbana, IL USA
来源
PLOS ONE | 2012年 / 7卷 / 10期
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
DATABASES; CONSERVATION; ORGANIZATION; GENERATION; EVOLUTION; BACTERIAL; ORIGINS; UNIPROT;
D O I
10.1371/journal.pone.0048053
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The remarkable advance in sequencing technology and the rising interest in medical and environmental microbiology, biotechnology, and synthetic biology resulted in a deluge of published microbial genomes. Yet, genome annotation, comparison, and modeling remain a major bottleneck to the translation of sequence information into biological knowledge, hence computational analysis tools are continuously being developed for rapid genome annotation and interpretation. Among the earliest, most comprehensive resources for prokaryotic genome analysis, the SEED project, initiated in 2003 as an integration of genomic data and analysis tools, now contains >5,000 complete genomes, a constantly updated set of curated annotations embodied in a large and growing collection of encoded subsystems, a derived set of protein families, and hundreds of genome-scale metabolic models. Until recently, however, maintaining current copies of the SEED code and data at remote locations has been a pressing issue. To allow high-performance remote access to the SEED database, we developed the SEED Servers (http://www.theseed.org/servers): four network-based servers intended to expose the data in the underlying relational database, support basic annotation services, offer programmatic access to the capabilities of the RAST annotation server, and provide access to a growing collection of metabolic models that support flux balance analysis. The SEED servers offer open access to regularly updated data, the ability to annotate prokaryotic genomes, the ability to create metabolic reconstructions and detailed models of metabolism, and access to hundreds of existing metabolic models. This work offers and supports a framework upon which other groups can build independent research efforts. Large integrations of genomic data represent one of the major intellectual resources driving research in biology, and programmatic access to the SEED data will provide significant utility to a broad collection of potential users.
引用
收藏
页数:10
相关论文
共 43 条
[1]   A Flood of Microbial Genomes-Do We Need More? [J].
Ahmed, Niyaz .
PLOS ONE, 2009, 4 (06)
[2]   PhiSpy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies [J].
Akhter, Sajia ;
Aziz, Ramy K. ;
Edwards, Robert A. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (16) :e126
[3]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   The RAST server: Rapid annotations using subsystems technology [J].
Aziz, Ramy K. ;
Bartels, Daniela ;
Best, Aaron A. ;
DeJongh, Matthew ;
Disz, Terrence ;
Edwards, Robert A. ;
Formsma, Kevin ;
Gerdes, Svetlana ;
Glass, Elizabeth M. ;
Kubal, Michael ;
Meyer, Folker ;
Olsen, Gary J. ;
Olson, Robert ;
Osterman, Andrei L. ;
Overbeek, Ross A. ;
McNeil, Leslie K. ;
Paarmann, Daniel ;
Paczian, Tobias ;
Parrello, Bruce ;
Pusch, Gordon D. ;
Reich, Claudia ;
Stevens, Rick ;
Vassieva, Olga ;
Vonstein, Veronika ;
Wilke, Andreas ;
Zagnitko, Olga .
BMC GENOMICS, 2008, 9 (1)
[6]   Pathogen Microevolution in High Resolution [J].
Aziz, Ramy K. ;
Nizet, Victor .
SCIENCE TRANSLATIONAL MEDICINE, 2010, 2 (16)
[7]   BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata [J].
Barrett, Tanya ;
Clark, Karen ;
Gevorgyan, Robert ;
Gorelenkov, Vyacheslav ;
Gribov, Eugene ;
Karsch-Mizrachi, Ilene ;
Kimelman, Michael ;
Pruitt, Kim D. ;
Resenchuk, Sergei ;
Tatusova, Tatiana ;
Yaschenko, Eugene ;
Ostell, James .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D57-D63
[8]  
Battegay M, 2020, SWISS MED WKLY, V150, DOI [10.4414/smw.w20203, 10.4414/smw.2020.20203]
[9]   Sequencing the species pan-genome [J].
Bentley, Stephen .
NATURE REVIEWS MICROBIOLOGY, 2009, 7 (04) :258-259
[10]   Genome sequence analyses of two isolates from the recent Escherichia coli outbreak in Germany reveal the emergence of a new pathotype: Entero-Aggregative-Haemorrhagic Escherichia coli (EAHEC) [J].
Brzuszkiewicz, Elzbieta ;
Thuermer, Andrea ;
Schuldes, Joerg ;
Leimbach, Andreas ;
Liesegang, Heiko ;
Meyer, Frauke-Dorothee ;
Boelter, Juergen ;
Petersen, Heiko ;
Gottschalk, Gerhard ;
Daniel, Rolf .
ARCHIVES OF MICROBIOLOGY, 2011, 193 (12) :883-891