A Primer on Metagenomics

被引:392
作者
Wooley, John C. [2 ]
Godzik, Adam [2 ,3 ]
Friedberg, Iddo [1 ,4 ]
机构
[1] Miami Univ, Dept Comp Sci & Software Engn, Oxford, OH 45056 USA
[2] Univ Calif San Diego, Community Cyberinfrastruct Marine Microbial Ecol, Calif Inst Telecommun & Informat Technol, La Jolla, CA 92093 USA
[3] Burnham Inst Med Res, Program Bioinformat & Syst Biol, La Jolla, CA USA
[4] Miami Univ, Dept Microbiol, Oxford, OH 45056 USA
关键词
16S RIBOSOMAL-RNA; PHYLOGENETIC CLASSIFICATION; FUNCTIONAL DIVERSITY; MICROBIAL DIVERSITY; NUCLEOTIDE-SEQUENCE; MINIMUM INFORMATION; VIRAL COMMUNITIES; MARINE ECOSYSTEMS; DATA-MANAGEMENT; DNA-SEQUENCES;
D O I
10.1371/journal.pcbi.1000667
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Metagenomics is a discipline that enables the genomic study of uncultured microorganisms. Faster, cheaper sequencing technologies and the ability to sequence uncultured microbes sampled directly from their habitats are expanding and transforming our view of the microbial world. Distilling meaningful information from the millions of new genomic sequences presents a serious challenge to bioinformaticians. In cultured microbes, the genomic data come from a single clone, making sequence assembly and annotation tractable. In metagenomics, the data come from heterogeneous microbial communities, sometimes containing more than 10,000 species, with the sequence data being noisy and partial. From sampling, to assembly, to gene calling and function prediction, bioinformatics faces new demands in interpreting voluminous, noisy, and often partial sequence data. Although metagenomics is a relative newcomer to science, the past few years have seen an explosion in computational methods applied to metagenomic-based research. It is therefore not within the scope of this article to provide an exhaustive review. Rather, we provide here a concise yet comprehensive introduction to the current computational requirements presented by metagenomics, and review the recent progress made. We also note whether there is software that implements any of the methods presented here, and briefly review its utility. Nevertheless, it would be useful if readers of this article would avail themselves of the comment section provided by this journal, and relate their own experiences. Finally, the last section of this article provides a few representative studies illustrating different facets of recent scientific discoveries made using metagenomics.
引用
收藏
页数:13
相关论文
共 134 条
  • [1] Photosynthetic and phylogenetic primers for detection of anoxygenic phototrophs in natural environments
    Achenbach, LA
    Carey, J
    Madigan, MT
    [J]. APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2001, 67 (07) : 2922 - 2926
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [4] PHYLOGENETIC IDENTIFICATION AND IN-SITU DETECTION OF INDIVIDUAL MICROBIAL-CELLS WITHOUT CULTIVATION
    AMANN, RI
    LUDWIG, W
    SCHLEIFER, KH
    [J]. MICROBIOLOGICAL REVIEWS, 1995, 59 (01) : 143 - 169
  • [5] PHACCS, an online tool for estimating the structure and diversity of uncultured viral communities using metagenomic information
    Angly, F
    Rodriguez-Brito, B
    Bangor, D
    McNairnie, P
    Breitbart, M
    Salamon, P
    Felts, B
    Nulton, J
    Mahaffy, J
    Rohwer, F
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [6] Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes
    Aparicio, S
    Chapman, J
    Stupka, E
    Putnam, N
    Chia, J
    Dehal, P
    Christoffels, A
    Rash, S
    Hoon, S
    Smit, A
    Gelpke, MDS
    Roach, J
    Oh, T
    Ho, IY
    Wong, M
    Detter, C
    Verhoef, F
    Predki, P
    Tay, A
    Lucas, S
    Richardson, P
    Smith, SF
    Clark, MS
    Edwards, YJK
    Doggett, N
    Zharkikh, A
    Tavtigian, SV
    Pruss, D
    Barnstead, M
    Evans, C
    Baden, H
    Powell, J
    Glusman, G
    Rowen, L
    Hood, L
    Tan, YH
    Elgar, G
    Hawkins, T
    Venkatesh, B
    Rokhsar, D
    Brenner, S
    [J]. SCIENCE, 2002, 297 (5585) : 1301 - 1310
  • [7] Probabilistic methods of identifying genes in prokaryotic genomes: Connections to the FIMM theory
    Azad, RK
    Borodovsky, M
    [J]. BRIEFINGS IN BIOINFORMATICS, 2004, 5 (02) : 118 - 130
  • [8] Soil eukaryotic functional diversity, a metatranscriptomic approach
    Bailly, Julie
    Fraissinet-Tachet, Laurence
    Verner, Marie-Christine
    Debaud, Jean-Claude
    Lemaire, Marc
    Wesolowski-Louvel, Micheline
    Marmeisse, Roland
    [J]. ISME JOURNAL, 2007, 1 (07) : 632 - 642
  • [9] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [10] Genome sequence data: management, storage, and visualization
    Batley, Jacqueline
    Edwards, David
    [J]. BIOTECHNIQUES, 2009, 46 (05) : 333 - +