Species-Level Deconvolution of Metagenome Assemblies with Hi-C-Based Contact Probability Maps

被引:129
作者
Burton, Joshua N. [1 ]
Liachko, Ivan [1 ]
Dunham, Maitreya J. [1 ]
Shendure, Jay [1 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
来源
G3-GENES GENOMES GENETICS | 2014年 / 4卷 / 07期
基金
美国国家科学基金会;
关键词
Hi-C; metagenome assembly; metagenomics; clustering algorithms; GENE-EXPRESSION; GENOME SEQUENCE; REVEALS; ORGANIZATION; DIVERSITY; ALIGNMENT;
D O I
10.1534/g3.114.011825
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Microbial communities consist of mixed populations of organisms, including unknown species in unknown abundances. These communities are often studied through metagenomic shotgun sequencing, but standard library construction methods remove long-range contiguity information; thus, shotgun sequencing and de novo assembly of a metagenome typically yield a collection of contigs that cannot readily be grouped by species. Methods for generating chromatin-level contact probability maps, e. g., as generated by the Hi-C method, provide a signal of contiguity that is completely intracellular and contains both intrachromosomal and interchromosomal information. Here, we demonstrate how this signal can be exploited to reconstruct the individual genomes of microbial species present within a mixed sample. We apply this approach to two synthetic metagenome samples, successfully clustering the genome content of fungal, bacterial, and archaeal species with more than 99% agreement with published reference genomes. We also show that the Hi-C signal can secondarily be used to create scaffolded genome assemblies of individual eukaryotic species present within the microbial community, with higher levels of contiguity than some of the species' published reference genomes.
引用
收藏
页码:1339 / 1346
页数:8
相关论文
共 39 条
[11]   Community-wide analysis of microbial genome sequence signatures [J].
Dick, Gregory J. ;
Andersson, Anders F. ;
Baker, Brett J. ;
Simmons, Sheri L. ;
Yelton, A. Pepper ;
Banfield, Jillian F. .
GENOME BIOLOGY, 2009, 10 (08)
[12]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[13]   Microbial community gene expression in ocean surface waters [J].
Frias-Lopez, Jorge. ;
Shi, Yanmei ;
Tyson, Gene W. ;
Coleman, Maureen L. ;
Schuster, Stephan C. ;
Chisholm, Sallie W. ;
DeLong, Edward F. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (10) :3805-3810
[14]   Tackling soil diversity with the assembly of large, complex metagenomes [J].
Howe, Adina Chuang ;
Jansson, Janet K. ;
Malfatti, Stephanie A. ;
Tringe, Susannah G. ;
Tiedje, James M. ;
Brown, C. Titus .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (13) :4904-4909
[15]   Community genomic analyses constrain the distribution of metabolic traits across the Chloroflexi phylum and indicate roles in sediment carbon cycling [J].
Hug, Laura A. ;
Castelle, Cindy J. ;
Wrighton, Kelly C. ;
Thomas, Brian C. ;
Sharon, Itai ;
Frischkorn, Kyle R. ;
Williams, Kenneth H. ;
Tringe, Susannah G. ;
Banfield, Jillian F. .
MICROBIOME, 2013, 1
[16]   Structure, function and diversity of the healthy human microbiome [J].
Huttenhower, Curtis ;
Gevers, Dirk ;
Knight, Rob ;
Abubucker, Sahar ;
Badger, Jonathan H. ;
Chinwalla, Asif T. ;
Creasy, Heather H. ;
Earl, Ashlee M. ;
FitzGerald, Michael G. ;
Fulton, Robert S. ;
Giglio, Michelle G. ;
Hallsworth-Pepin, Kymberlie ;
Lobos, Elizabeth A. ;
Madupu, Ramana ;
Magrini, Vincent ;
Martin, John C. ;
Mitreva, Makedonka ;
Muzny, Donna M. ;
Sodergren, Erica J. ;
Versalovic, James ;
Wollam, Aye M. ;
Worley, Kim C. ;
Wortman, Jennifer R. ;
Young, Sarah K. ;
Zeng, Qiandong ;
Aagaard, Kjersti M. ;
Abolude, Olukemi O. ;
Allen-Vercoe, Emma ;
Alm, Eric J. ;
Alvarado, Lucia ;
Andersen, Gary L. ;
Anderson, Scott ;
Appelbaum, Elizabeth ;
Arachchi, Harindra M. ;
Armitage, Gary ;
Arze, Cesar A. ;
Ayvaz, Tulin ;
Baker, Carl C. ;
Begg, Lisa ;
Belachew, Tsegahiwot ;
Bhonagiri, Veena ;
Bihan, Monika ;
Blaser, Martin J. ;
Bloom, Toby ;
Bonazzi, Vivien ;
Brooks, J. Paul ;
Buck, Gregory A. ;
Buhay, Christian J. ;
Busam, Dana A. ;
Campbell, Joseph L. .
NATURE, 2012, 486 (7402) :207-214
[17]   Untangling Genomes from Metagenomes: Revealing an Uncultured Class of Marine Euryarchaeota [J].
Iverson, Vaughn ;
Morris, Robert M. ;
Frazar, Christian D. ;
Berthiaume, Chris T. ;
Morales, Rhonda L. ;
Armbrust, E. Virginia .
SCIENCE, 2012, 335 (6068) :587-590
[18]   CLUSTERING USING A SIMILARITY MEASURE BASED ON SHARED NEAR NEIGHBORS [J].
JARVIS, RA ;
PATRICK, EA .
IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (11) :1025-1034
[19]   Genome sequence of the lignocellulose-bioconverting and xylose-fermenting yeast Pichia stipitis [J].
Jeffries, Thomas W. ;
Grigoriev, Igor V. ;
Grimwood, Jane ;
Laplaza, Jose M. ;
Aerts, Andrea ;
Salamov, Asaf ;
Schmutz, Jeremy ;
Lindquist, Erika ;
Dehal, Paramvir ;
Shapiro, Harris ;
Jin, Yong-Su ;
Passoth, Volkmar ;
Richardson, Paul M. .
NATURE BIOTECHNOLOGY, 2007, 25 (03) :319-326
[20]   High-throughput genome scaffolding from in vivo DNA interaction frequency [J].
Kaplan, Noam ;
Dekker, Job .
NATURE BIOTECHNOLOGY, 2013, 31 (12) :1143-+