A Bioinformatician's Guide to Metagenomics

被引:260
作者
Kunin, Victor [1 ]
Copeland, Alex [2 ]
Lapidus, Alla [3 ]
Mavromatis, Konstantinos [4 ]
Hugenholtz, Philip [1 ]
机构
[1] DOE Joint Genome Inst, Microbial Ecol Program, Walnut Creek, CA USA
[2] DOE Joint Genome Inst, Qual Assurance Dept, Walnut Creek, CA USA
[3] DOE Joint Genome Inst, Microbial Genom Dept, Walnut Creek, CA USA
[4] DOE Joint Genome Inst, Genome Biol Program, Walnut Creek, CA USA
关键词
D O I
10.1128/MMBR.00009-08
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
As random shotgun metagenomic projects proliferate and become the dominant source of publicly available sequence data, procedures for the best practices in their execution and analysis become increasingly important. Based on our experience at the Joint Genome Institute, we describe the chain of decisions accompanying a metagenomic project from the viewpoint of the bioinformatic analysis step by step. We guide the reader through a standard workflow for a metagenomic project beginning with presequencing considerations such as community composition and sequence data type that will greatly influence downstream analyses. We proceed with recommendations for sampling and data generation including sample and metadata collection, community profiling, construction of shotgun libraries, and sequencing strategies. We then discuss the application of generic sequence processing steps (read preprocessing, assembly, and gene prediction and annotation) to metagenomic data sets in contrast to genome projects. Different types of data analyses particular to metagenomes are then presented, including binning, dominant population analysis, and gene-centric analysis. Finally, data management issues are presented and discussed. We hope that this review will assist bioinformaticians and biologists in making better-informed decisions on their journey during a metagenomic project.
引用
收藏
页码:557 / 578
页数:22
相关论文
共 152 条
[1]   Informatics for unveiling hidden genome signatures [J].
Abe, T ;
Kanaya, S ;
Kinouchi, M ;
Ichiba, Y ;
Kozuki, T ;
Ikemura, T .
GENOME RESEARCH, 2003, 13 (04) :693-702
[2]   Novel phylogenetic studies of genomic sequence fragments derived from uncultured microbe mixtures in environmental and clinical samples [J].
Abe, Takashi ;
Sugawara, Hideaki ;
Kinouchi, Makoto ;
Kanaya, Shigehiko ;
Ikemura, Toshimichi .
DNA RESEARCH, 2005, 12 (05) :281-290
[3]   Microbial diversity and the genetic nature of microbial species [J].
Achtman, Mark ;
Wagner, Michael .
NATURE REVIEWS MICROBIOLOGY, 2008, 6 (06) :431-440
[4]   Community genomics in microbial ecology and evolution [J].
Allen, EE ;
Banfield, JF .
NATURE REVIEWS MICROBIOLOGY, 2005, 3 (06) :489-498
[5]   Genome dynamics in a natural archaeal population [J].
Allen, Eric E. ;
Tyson, Gene W. ;
Whitaker, Rachel J. ;
Detter, John C. ;
Richardson, Paul M. ;
Banfield, Jillian F. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (06) :1883-1888
[6]   The MicrobesOnline web site for comparative genomics [J].
Alm, EJ ;
Huang, KH ;
Price, MN ;
Koche, RP ;
Keller, K ;
Dubchak, IL ;
Arkin, AP .
GENOME RESEARCH, 2005, 15 (07) :1015-1022
[7]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[8]   The identification of microorganisms by fluorescence in situ hybridisation [J].
Amann, R ;
Fuchs, BM ;
Behrens, S .
CURRENT OPINION IN BIOTECHNOLOGY, 2001, 12 (03) :231-236
[9]   The marine viromes of four oceanic regions [J].
Angly, Florent E. ;
Felts, Ben ;
Breitbart, Mya ;
Salamon, Peter ;
Edwards, Robert A. ;
Carlson, Craig ;
Chan, Amy M. ;
Haynes, Matthew ;
Kelley, Scott ;
Liu, Hong ;
Mahaffy, Joseph M. ;
Mueller, Jennifer E. ;
Nulton, Jim ;
Olson, Robert ;
Parsons, Rachel ;
Rayhawk, Steve ;
Suttle, Curtis A. ;
Rohwer, Forest .
PLOS BIOLOGY, 2006, 4 (11) :2121-2131
[10]   Evaluation of Phi29-based whole-genome amplification for microarray-based comparative genomic hybridisation [J].
Arriola, Edurne ;
Lambros, Maryou B. K. ;
Jones, Chris ;
Dexter, Tim ;
Mackay, Alan ;
Tan, David S. P. ;
Tamber, Narinder ;
Fenwick, Kerry ;
Ashworth, Alan ;
Dowsett, Mitch ;
Reis-Filho, Jorge S. .
LABORATORY INVESTIGATION, 2007, 87 (01) :75-83