A Primer on Metagenomics

被引:392
作者
Wooley, John C. [2 ]
Godzik, Adam [2 ,3 ]
Friedberg, Iddo [1 ,4 ]
机构
[1] Miami Univ, Dept Comp Sci & Software Engn, Oxford, OH 45056 USA
[2] Univ Calif San Diego, Community Cyberinfrastruct Marine Microbial Ecol, Calif Inst Telecommun & Informat Technol, La Jolla, CA 92093 USA
[3] Burnham Inst Med Res, Program Bioinformat & Syst Biol, La Jolla, CA USA
[4] Miami Univ, Dept Microbiol, Oxford, OH 45056 USA
关键词
16S RIBOSOMAL-RNA; PHYLOGENETIC CLASSIFICATION; FUNCTIONAL DIVERSITY; MICROBIAL DIVERSITY; NUCLEOTIDE-SEQUENCE; MINIMUM INFORMATION; VIRAL COMMUNITIES; MARINE ECOSYSTEMS; DATA-MANAGEMENT; DNA-SEQUENCES;
D O I
10.1371/journal.pcbi.1000667
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Metagenomics is a discipline that enables the genomic study of uncultured microorganisms. Faster, cheaper sequencing technologies and the ability to sequence uncultured microbes sampled directly from their habitats are expanding and transforming our view of the microbial world. Distilling meaningful information from the millions of new genomic sequences presents a serious challenge to bioinformaticians. In cultured microbes, the genomic data come from a single clone, making sequence assembly and annotation tractable. In metagenomics, the data come from heterogeneous microbial communities, sometimes containing more than 10,000 species, with the sequence data being noisy and partial. From sampling, to assembly, to gene calling and function prediction, bioinformatics faces new demands in interpreting voluminous, noisy, and often partial sequence data. Although metagenomics is a relative newcomer to science, the past few years have seen an explosion in computational methods applied to metagenomic-based research. It is therefore not within the scope of this article to provide an exhaustive review. Rather, we provide here a concise yet comprehensive introduction to the current computational requirements presented by metagenomics, and review the recent progress made. We also note whether there is software that implements any of the methods presented here, and briefly review its utility. Nevertheless, it would be useful if readers of this article would avail themselves of the comment section provided by this journal, and relate their own experiences. Finally, the last section of this article provides a few representative studies illustrating different facets of recent scientific discoveries made using metagenomics.
引用
收藏
页数:13
相关论文
共 134 条
  • [111] TETRA:: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in DNA sequences -: art. no. 163
    Teeling, H
    Waldmann, J
    Lombardot, T
    Bauer, M
    Glöckner, FO
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [112] HIGH DIVERSITY IN DNA OF SOIL BACTERIA
    TORSVIK, V
    GOKSOYR, J
    DAAE, FL
    [J]. APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 1990, 56 (03) : 782 - 787
  • [113] DNA methylation analysis by pyrosequencing
    Tost, Joeg
    Gut, Ivo G.
    [J]. NATURE PROTOCOLS, 2007, 2 (09) : 2265 - 2275
  • [114] An obesity-associated gut microbiome with increased capacity for energy harvest
    Turnbaugh, Peter J.
    Ley, Ruth E.
    Mahowald, Michael A.
    Magrini, Vincent
    Mardis, Elaine R.
    Gordon, Jeffrey I.
    [J]. NATURE, 2006, 444 (7122) : 1027 - 1031
  • [115] A supervised learning approach for taxonomic classification of core-photosystem-II genes and transcripts in the marine environment
    Tzahor, Shani
    Man-Aharonovich, Dikla
    Kirkup, Benjamin C.
    Yogev, Tali
    Berman-Frank, Ilana
    Polz, Martin F.
    Beja, Oded
    Mandel-Gutfreund, Yael
    [J]. BMC GENOMICS, 2009, 10
  • [116] Quantitative phylogenetic assessment of microbial communities in diverse environments
    von Mering, C.
    Hugenholtz, P.
    Raes, J.
    Tringe, S. G.
    Doerks, T.
    Jensen, L. J.
    Ward, N.
    Bork, P.
    [J]. SCIENCE, 2007, 315 (5815) : 1126 - 1130
  • [117] Evolution of the RNA polymerase B′ subunit gene (rpoB′) in Halobacteriales:: a complementary molecular marker to the SSU rRNA gene
    Walsh, DA
    Bapteste, E
    Kamekura, M
    Doolittle, WF
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (12) : 2340 - 2351
  • [118] Assembling millions of short DNA sequences using SSAKE
    Warren, Rene L.
    Sutton, Granger G.
    Jones, Steven J. M.
    Holt, Robert A.
    [J]. BIOINFORMATICS, 2007, 23 (04) : 500 - 501
  • [119] Westbrook John D, 2003, Methods Biochem Anal, V44, P161
  • [120] Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples
    White, James Robert
    Nagarajan, Niranjan
    Pop, Mihai
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (04)