Reconstructing the Genomic Content of Microbiome Taxa through Shotgun Metagenomic Deconvolution

被引:35
作者
Carr, Rogan [1 ]
Shen-Orr, Shai S. [2 ,3 ]
Borenstein, Elhanan [1 ,4 ,5 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Technion Israel Inst Technol, Fac Med, Rappaport Inst Med Res, Dept Immunol, Haifa, Israel
[3] Technion Israel Inst Technol, Fac Biol, Haifa, Israel
[4] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
[5] Santa Fe Inst, Santa Fe, NM 87501 USA
基金
美国国家卫生研究院;
关键词
GUT MICROBIOME; ALGORITHM; SEQUENCES; DIVERSITY; EVOLUTION; DATABASE; CATALOG; READS; AGE;
D O I
10.1371/journal.pcbi.1003292
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Metagenomics has transformed our understanding of the microbial world, allowing researchers to bypass the need to isolate and culture individual taxa and to directly characterize both the taxonomic and gene compositions of environmental samples. However, associating the genes found in a metagenomic sample with the specific taxa of origin remains a critical challenge. Existing binning methods, based on nucleotide composition or alignment to reference genomes allow only a coarse-grained classification and rely heavily on the availability of sequenced genomes from closely related taxa. Here, we introduce a novel computational framework, integrating variation in gene abundances across multiple samples with taxonomic abundance data to deconvolve metagenomic samples into taxa-specific gene profiles and to reconstruct the genomic content of community members. This assembly-free method is not bounded by various factors limiting previously described methods of metagenomic binning or metagenomic assembly and represents a fundamentally different approach to metagenomic-based genome reconstruction. An implementation of this framework is available at http://elbo.gs.washington.edu/software.html. We first describe the mathematical foundations of our framework and discuss considerations for implementing its various components. We demonstrate the ability of this framework to accurately deconvolve a set of metagenomic samples and to recover the gene content of individual taxa using synthetic metagenomic samples. We specifically characterize determinants of prediction accuracy and examine the impact of annotation errors on the reconstructed genomes. We finally apply metagenomic deconvolution to samples from the Human Microbiome Project, successfully reconstructing genus-level genomic content of various microbial genera, based solely on variation in gene count. These reconstructed genera are shown to correctly capture genus-specific properties. With the accumulation of metagenomic data, this deconvolution framework provides an essential tool for characterizing microbial taxa never before seen, laying the foundation for addressing fundamental questions concerning the taxa comprising diverse microbial communities.
引用
收藏
页数:15
相关论文
共 68 条
[1]   Metabolic Reconstruction for Metagenomic Data and Its Application to the Human Microbiome [J].
Abubucker, Sahar ;
Segata, Nicola ;
Goll, Johannes ;
Schubert, Alyxandria M. ;
Izard, Jacques ;
Cantarel, Brandi L. ;
Rodriguez-Mueller, Beltran ;
Zucker, Jeremy ;
Thiagarajan, Mathangi ;
Henrissat, Bernard ;
White, Owen ;
Kelley, Scott T. ;
Methe, Barbara ;
Schloss, Patrick D. ;
Gevers, Dirk ;
Mitreva, Makedonka ;
Huttenhower, Curtis .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (06)
[2]   Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes [J].
Albertsen, Mads ;
Hugenholtz, Philip ;
Skarshewski, Adam ;
Nielsen, Kare L. ;
Tyson, Gene W. ;
Nielsen, Per H. .
NATURE BIOTECHNOLOGY, 2013, 31 (06) :533-+
[3]  
[Anonymous], 2000, THESIS U ULTRECHT
[4]   Joint Analysis of Multiple Metagenomic Samples [J].
Baran, Yael ;
Halperin, Eran .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (02)
[5]   Revealing structure and assembly cues for Arabidopsis root-inhabiting bacterial microbiota [J].
Bulgarelli, Davide ;
Rott, Matthias ;
Schlaeppi, Klaus ;
van Themaat, Emiel Ver Loren ;
Ahmadinejad, Nahal ;
Assenza, Federica ;
Rauf, Philipp ;
Huettel, Bruno ;
Reinhardt, Richard ;
Schmelzer, Elmon ;
Peplies, Joerg ;
Gloeckner, Frank Oliver ;
Amann, Rudolf ;
Eickhorst, Thilo ;
Schulze-Lefert, Paul .
NATURE, 2012, 488 (7409) :91-95
[6]   Binning sequences using very sparse labels within a metagenome [J].
Chan, Chon-Kit Kenneth ;
Hsu, Arthur L. ;
Halgamuge, Saman K. ;
Tang, Sen-Lin .
BMC BIOINFORMATICS, 2008, 9 (1)
[7]   Using growing self-organising maps to improve the binning process in environmental whole-genome shotgun sequencing [J].
Chan, Chon-Kit Kenneth ;
Hsu, Arthur L. ;
Tang, Sen-Lin ;
Halgamuge, Saman K. .
JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2008,
[8]  
Chatterji S, 2008, LECT N BIOINFORMAT, V4955, P17
[9]   Testing the Infinitely Many Genes Model for the Evolution of the Bacterial Core Genome and Pangenome [J].
Collins, R. Eric ;
Higgs, Paul G. .
MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (11) :3413-3425
[10]   In Situ Evolutionary Rate Measurements Show Ecological Success of Recently Emerged Bacterial Hybrids [J].
Denef, Vincent J. ;
Banfield, Jillian F. .
SCIENCE, 2012, 336 (6080) :462-466