Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome

被引:503
作者
Bickhart, Derek M. [1 ]
Rosen, Benjamin D. [2 ]
Koren, Sergey [3 ]
Sayre, Brian L. [4 ]
Hastie, Alex R. [5 ]
Chan, Saki [5 ]
Lee, Joyce [5 ]
Lam, Ernest T. [5 ]
Liachko, Ivan [6 ]
Sullivan, Shawn T. [7 ]
Burton, Joshua N. [6 ]
Huson, Heather J. [8 ]
Nystrom, John C. [8 ]
Kelley, Christy M. [9 ]
Hutchison, Jana L. [2 ]
Zhou, Yang [2 ,10 ]
Sun, Jiajie [11 ]
Crisa, Alessandra [12 ]
de Leon, F. Abel Ponce [13 ]
Schwartz, John C. [14 ]
Hammond, John A. [14 ]
Waldbieser, Geoffrey C. [15 ]
Schroeder, Steven G. [2 ]
Liu, George E. [2 ]
Dunham, Maitreya J. [6 ]
Shendure, Jay [6 ,16 ]
Sonstegard, Tad S. [17 ]
Phillippy, Adam M. [3 ]
Van Tassell, Curtis P. [2 ]
Smith, Timothy P. L. [9 ]
机构
[1] USDA ARS, Cell Wall Biol & Utilizat Lab, Madison, WI USA
[2] USDA ARS, Anim Genom & Improvement Lab, Beltsville, MD USA
[3] Natl Human Genome Res Inst, Computat & Stat Genom Branch, Genome Informat Sect, Bethesda, MD USA
[4] Virginia State Univ, Dept Biol, Petersburg, VA USA
[5] BioNano Genom, San Diego, CA USA
[6] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA USA
[7] Phase Genom, Seattle, WA USA
[8] Cornell Univ, Dept Anim Sci, Ithaca, NY USA
[9] USDA ARS, Meat Anim Res Ctr, Clay Ctr, NE USA
[10] Northwest A&F Univ, Coll Anim Sci & Technol, Shaanxi Key Lab Agr Mol Biol, Yangling, Peoples R China
[11] China Agr Univ, Guangzhou, Peoples R China
[12] CREA, Anim Prod Res Ctr, Rome, Italy
[13] Univ Minnesota, Dept Anim Sci, St Paul, MN USA
[14] Pirbright Inst, Livestock Viral Dis Programme, Woking, Surrey, England
[15] USDA ARS, Warmwater Aquaculture Res Unit, Stoneville, MS USA
[16] Howard Hughes Med Inst, Seattle, WA USA
[17] Recombinet Inc, St Paul, MN USA
基金
英国生物技术与生命科学研究理事会; 美国食品与农业研究所;
关键词
CHROMOSOMES; ANNOTATION; CATTLE; RECONSTRUCTION; TRANSCRIPTOME; COMPLEXITY; ALIGNMENT; TIME;
D O I
10.1038/ng.3802
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
The decrease in sequencing cost and increased sophistication of assembly algorithms for short-read platforms has resulted in a sharp increase in the number of species with genome assemblies. However, these assemblies are highly fragmented, with many gaps, ambiguities, and errors, impeding downstream applications. We demonstrate current state of the art for de novo assembly using the domestic goat (Capra hircus) based on long reads for contig formation, short reads for consensus validation, and scaffolding by optical and chromatin interaction mapping. These combined technologies produced what is, to our knowledge, the most continuous de novo mammalian assembly to date, with chromosome-length scaffolds and only 649 gaps. Our assembly represents a similar to 400-fold improvement in continuity due to properly assembled gaps, compared to the previously published C. hircus assembly, and better resolves repetitive structures longer than 1 kb, representing the largest repeat family and immune gene complex yet produced for an individual of a ruminant species.
引用
收藏
页码:643 / +
页数:11
相关论文
共 71 条
[1]
[Anonymous], 2016, EVALUATION GRCH38 NO
[2]
The sheep genome reference sequence: a work in progress [J].
Archibald, A. L. ;
Cockett, N. E. ;
Dalrymple, B. P. ;
Faraut, T. ;
Kijas, J. W. ;
Maddox, J. F. ;
McEwan, J. C. ;
Oddy, V. Hutton ;
Raadsma, H. W. ;
Wade, C. ;
Wang, J. ;
Wang, W. ;
Xun, X. .
ANIMAL GENETICS, 2010, 41 (05) :449-453
[3]
Detecting heterozygosity in shotgun genome assemblies: Lessons from obligately outcrossing nematodes [J].
Barriere, Antoine ;
Yang, Shiaw-Pyng ;
Pekarek, Elizabeth ;
Thomas, Cristel G. ;
Haag, Eric S. ;
Ruvinsky, Ilya .
GENOME RESEARCH, 2009, 19 (03) :470-480
[4]
Assembling large genomes with single-molecule sequencing and locality-sensitive hashing [J].
Berlin, Konstantin ;
Koren, Sergey ;
Chin, Chen-Shan ;
Drake, James P. ;
Landolin, Jane M. ;
Phillippy, Adam M. .
NATURE BIOTECHNOLOGY, 2015, 33 (06) :623-+
[5]
SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information [J].
Boetzer, Marten ;
Pirovano, Walter .
BMC BIOINFORMATICS, 2014, 15
[6]
Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[7]
Species-Level Deconvolution of Metagenome Assemblies with Hi-C-Based Contact Probability Maps [J].
Burton, Joshua N. ;
Liachko, Ivan ;
Dunham, Maitreya J. ;
Shendure, Jay .
G3-GENES GENOMES GENETICS, 2014, 4 (07) :1339-1346
[8]
Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions [J].
Burton, Joshua N. ;
Adey, Andrew ;
Patwardhan, Rupali P. ;
Qiu, Ruolan ;
Kitzman, Jacob O. ;
Shendure, Jay .
NATURE BIOTECHNOLOGY, 2013, 31 (12) :1119-+
[9]
APPLICATIONS OF NEXT-GENERATION SEQUENCING Genetic variation and the de novo assembly of human genomes [J].
Chaisson, Mark J. P. ;
Wilson, Richard K. ;
Eichler, Evan E. .
NATURE REVIEWS GENETICS, 2015, 16 (11) :627-640
[10]
Chin CS, 2016, NAT METHODS, V13, P1050, DOI [10.1038/NMETH.4035, 10.1038/nmeth.4035]