A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea

被引:696
作者
Wu, Dongying [1 ,2 ]
Hugenholtz, Philip [1 ]
Mavromatis, Konstantinos [1 ]
Pukall, Ruediger [3 ]
Dalin, Eileen [1 ]
Ivanova, Natalia N. [1 ]
Kunin, Victor [1 ]
Goodwin, Lynne [4 ]
Wu, Martin [6 ]
Tindall, Brian J. [3 ]
Hooper, Sean D. [1 ]
Pati, Amrita [1 ]
Lykidis, Athanasios [1 ]
Spring, Stefan [3 ]
Anderson, Iain J. [1 ]
D'haeseleer, Patrik [1 ,5 ]
Zemla, Adam [5 ]
Singer, Mitchell [2 ]
Lapidus, Alla [1 ]
Nolan, Matt [1 ]
Copeland, Alex [1 ]
Han, Cliff [4 ]
Chen, Feng [1 ]
Cheng, Jan-Fang [1 ]
Lucas, Susan [1 ]
Kerfeld, Cheryl [1 ]
Lang, Elke [3 ]
Gronow, Sabine [3 ]
Chain, Patrick [1 ,4 ]
Bruce, David [4 ]
Rubin, Edward M. [1 ]
Kyrpides, Nikos C. [1 ]
Klenk, Hans-Peter [3 ]
Eisen, Jonathan A. [1 ,2 ]
机构
[1] DOE Joint Genome Inst, Walnut Creek, CA 94598 USA
[2] Univ Calif Davis, Davis, CA 95616 USA
[3] Deutsch Sammlung Mikroorganism Zellkultur GmbH, German Collect Microorganisms & Cell Cultures, D-38124 Braunschweig, Germany
[4] Los Alamos Natl Lab, DOE Joint Genome Inst, Los Alamos, NM 87545 USA
[5] Univ Virginia, Charlottesville, VA 22904 USA
[6] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
关键词
MICROBIAL DIVERSITY; PROTEIN FAMILIES; DATABASE GOLD; PROJECTS; SEQUENCE;
D O I
10.1038/nature08656
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sequencing of bacterial and archaeal genomes has revolutionized our understanding of the many roles played by microorganisms(1). There are now nearly 1,000 completed bacterial and archaeal genomes available(2), most of which were chosen for sequencing on the basis of their physiology. As a result, the perspective provided by the currently available genomes is limited by a highly biased phylogenetic distribution(3-5). To explore the value added by choosing microbial genomes for sequencing on the basis of their evolutionary relationships, we have sequenced and analysed the genomes of 56 culturable species of Bacteria and Archaea selected to maximize phylogenetic coverage. Analysis of these genomes demonstrated pronounced benefits ( compared to an equivalent set of genomes randomly selected from the existing database) in diverse areas including the reconstruction of phylogenetic history, the discovery of new protein families and biological properties, and the prediction of functions for known genes from other organisms. Our results strongly support the need for systematic 'phylogenomic' efforts to compile a phylogeny-driven 'Genomic Encyclopedia of Bacteria and Archaea' in order to derive maximum knowledge from existing microbial genome data as well as from genome sequences to come.
引用
收藏
页码:1056 / 1060
页数:5
相关论文
共 30 条
[1]   Microbial diversity and the genetic nature of microbial species [J].
Achtman, Mark ;
Wagner, Michael .
NATURE REVIEWS MICROBIOLOGY, 2008, 6 (06) :431-440
[2]   CRISPR provides acquired resistance against viruses in prokaryotes [J].
Barrangou, Rodolphe ;
Fremaux, Christophe ;
Deveau, Helene ;
Richards, Melissa ;
Boyaval, Patrick ;
Moineau, Sylvain ;
Romero, Dennis A. ;
Horvath, Philippe .
SCIENCE, 2007, 315 (5819) :1709-1712
[3]   The Impact of Reticulate Evolution on Genome Phylogeny [J].
Beiko, Robert G. ;
Doolittle, W. Ford ;
Charlebois, Robert L. .
SYSTEMATIC BIOLOGY, 2008, 57 (06) :844-856
[4]   Genomes OnLine Database (GOLD): a monitor of genome projects world-wide [J].
Bernal, A ;
Ear, U ;
Kyrpides, N .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :126-127
[5]   Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB [J].
DeSantis, T. Z. ;
Hugenholtz, P. ;
Larsen, N. ;
Rojas, M. ;
Brodie, E. L. ;
Keller, K. ;
Huber, T. ;
Dalevi, D. ;
Hu, P. ;
Andersen, G. L. .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2006, 72 (07) :5069-5072
[6]   Bacterial actins? An evolutionary perspective [J].
Doolittle, RF ;
York, AL .
BIOESSAYS, 2002, 24 (04) :293-296
[7]   Assessing evolutionary relationships among microbes from whole-genome analysis [J].
Eisen, JA .
CURRENT OPINION IN MICROBIOLOGY, 2000, 3 (05) :475-480
[8]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[9]   Protein interaction maps for complete genomes based on gene fusion events [J].
Enright, AJ ;
Iliopoulos, I ;
Kyrpides, NC ;
Ouzounis, CA .
NATURE, 1999, 402 (6757) :86-90
[10]   The minimum information about a genome sequence (MIGS) specification [J].
Field, Dawn ;
Garrity, George ;
Gray, Tanya ;
Morrison, Norman ;
Selengut, Jeremy ;
Sterk, Peter ;
Tatusova, Tatiana ;
Thomson, Nicholas ;
Allen, Michael J. ;
Angiuoli, Samuel V. ;
Ashburner, Michael ;
Axelrod, Nelson ;
Baldauf, Sandra ;
Ballard, Stuart ;
Boore, Jeffrey ;
Cochrane, Guy ;
Cole, James ;
Dawyndt, Peter ;
De Vos, Paul ;
dePamphilis, Claude ;
Edwards, Robert ;
Faruque, Nadeem ;
Feldman, Robert ;
Gilbert, Jack ;
Gilna, Paul ;
Gloeckner, Frank Oliver ;
Goldstein, Philip ;
Guralnick, Robert ;
Haft, Dan ;
Hancock, David ;
Hermjakob, Henning ;
Hertz-Fowler, Christiane ;
Hugenholtz, Phil ;
Joint, Ian ;
Kagan, Leonid ;
Kane, Matthew ;
Kennedy, Jessie ;
Kowalchuk, George ;
Kottmann, Renzo ;
Kolker, Eugene ;
Kravitz, Saul ;
Kyrpides, Nikos ;
Leebens-Mack, Jim ;
Lewis, Suzanna E. ;
Li, Kelvin ;
Lister, Allyson L. ;
Lord, Phillip ;
Maltsev, Natalia ;
Markowitz, Victor ;
Martiny, Jennifer .
NATURE BIOTECHNOLOGY, 2008, 26 (05) :541-547