Haplotype-resolved genome sequencing of a Gujarati Indian individual

被引:168
作者
Kitzman, Jacob O. [1 ]
MacKenzie, Alexandra P. [1 ]
Adey, Andrew [1 ]
Hiatt, Joseph B. [1 ]
Patwardhan, Rupali P. [1 ]
Sudmant, Peter H. [1 ]
Ng, Sarah B. [1 ]
Alkan, Can [1 ,2 ]
Qiu, Ruolan [1 ]
Eichler, Evan E. [1 ,2 ]
Shendure, Jay [1 ]
机构
[1] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[2] Howard Hughes Med Inst, Seattle, WA USA
基金
美国国家卫生研究院; 美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
STRUCTURAL VARIATION; COPY-NUMBER; SHORT-READ; ALGORITHMS;
D O I
10.1038/nbt.1740
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Haplotype information is essential to the complete description and interpretation of genomes(1), genetic diversity(2) and genetic ancestry(3). Although individual human genome sequencing is increasingly routine(4), nearly all such genomes are unresolved with respect to haplotype. Here we combine the throughput of massively parallel sequencing(5) with the contiguity information provided by large-insert cloning(6) to experimentally determine the haplotype-resolved genome of a South Asian individual. A single fosmid library was split into a modest number of pools, each providing similar to 3% physical coverage of the diploid genome. Sequencing of each pool yielded reads overwhelmingly derived from only one homologous chromosome at any given location. These data were combined with whole-genome shotgun sequence to directly phase 94% of ascertained heterozygous single nucleotide polymorphisms (SNPs) into long haplotype blocks (N50 of 386 kilobases (kbp)). This method also facilitates the analysis of structural variation, for example, to anchor novel insertions(7,8) to specific locations and haplotypes.
引用
收藏
页码:59 / +
页数:6
相关论文
共 33 条
[21]   Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding [J].
McKernan, Kevin Judd ;
Peckham, Heather E. ;
Costa, Gina L. ;
McLaughlin, Stephen F. ;
Fu, Yutao ;
Tsung, Eric F. ;
Clouser, Christopher R. ;
Duncan, Cisyla ;
Ichikawa, Jeffrey K. ;
Lee, Clarence C. ;
Zhang, Zheng ;
Ranade, Swati S. ;
Dimalanta, Eileen T. ;
Hyland, Fiona C. ;
Sokolsky, Tanya D. ;
Zhang, Lei ;
Sheridan, Andrew ;
Fu, Haoning ;
Hendrickson, Cynthia L. ;
Li, Bin ;
Kotler, Lev ;
Stuart, Jeremy R. ;
Malek, Joel A. ;
Manning, Jonathan M. ;
Antipova, Alena A. ;
Perez, Damon S. ;
Moore, Michael P. ;
Hayashibara, Kathleen C. ;
Lyons, Michael R. ;
Beaudoin, Robert E. ;
Coleman, Brittany E. ;
Laptewicz, Michael W. ;
Sannicandro, Adam E. ;
Rhodes, Michael D. ;
Gottimukkala, Rajesh K. ;
Yang, Shan ;
Bafna, Vineet ;
Bashir, Ali ;
MacBride, Andrew ;
Alkan, Can ;
Kidd, Jeffrey M. ;
Eichler, Evan E. ;
Reese, Martin G. ;
De la Vega, Francisco M. ;
Blanchard, Alan P. .
GENOME RESEARCH, 2009, 19 (09) :1527-1541
[22]   Exome sequencing identifies the cause of a mendelian disorder [J].
Ng, Sarah B. ;
Buckingham, Kati J. ;
Lee, Choli ;
Bigham, Abigail W. ;
Tabor, Holly K. ;
Dent, Karin M. ;
Huff, Chad D. ;
Shannon, Paul T. ;
Jabs, Ethylin Wang ;
Nickerson, Deborah A. ;
Shendure, Jay ;
Bamshad, Michael J. .
NATURE GENETICS, 2010, 42 (01) :30-U41
[23]   Targeted capture and massively parallel sequencing of 12 human exomes [J].
Ng, Sarah B. ;
Turner, Emily H. ;
Robertson, Peggy D. ;
Flygare, Steven D. ;
Bigham, Abigail W. ;
Lee, Choli ;
Shaffer, Tristan ;
Wong, Michelle ;
Bhattacharjee, Arindam ;
Eichler, Evan E. ;
Bamshad, Michael ;
Nickerson, Deborah A. ;
Shendure, Jay .
NATURE, 2009, 461 (7261) :272-U153
[24]   Targeted, haplotype-resolved resequencing of long segments of the human genome [J].
Raymond, CK ;
Subramanian, S ;
Paddock, M ;
Qiu, RL ;
Deodato, C ;
Palmieri, A ;
Chang, J ;
Radke, T ;
Haugen, E ;
Kas, A ;
Waring, D ;
Bovee, D ;
Stacy, R ;
Kaul, R ;
Olson, MV .
GENOMICS, 2005, 86 (06) :759-766
[25]   Reconstructing Indian population history [J].
Reich, David ;
Thangaraj, Kumarasamy ;
Patterson, Nick ;
Price, Alkes L. ;
Singh, Lalji .
NATURE, 2009, 461 (7263) :489-U50
[26]   Analysis of Genetic Inheritance in a Family Quartet by Whole-Genome Sequencing [J].
Roach, Jared C. ;
Glusman, Gustavo ;
Smit, Arian F. A. ;
Huff, Chad D. ;
Hubley, Robert ;
Shannon, Paul T. ;
Rowen, Lee ;
Pant, Krishna P. ;
Goodman, Nathan ;
Bamshad, Michael ;
Shendure, Jay ;
Drmanac, Radoje ;
Jorde, Lynn B. ;
Hood, Leroy ;
Galas, David J. .
SCIENCE, 2010, 328 (5978) :636-639
[27]   Assembly of large genomes using second-generation sequencing [J].
Schatz, Michael C. ;
Delcher, Arthur L. ;
Salzberg, Steven L. .
GENOME RESEARCH, 2010, 20 (09) :1165-1173
[28]   Next-generation DNA sequencing [J].
Shendure, Jay ;
Ji, Hanlee .
NATURE BIOTECHNOLOGY, 2008, 26 (10) :1135-1145
[29]   Diversity of Human Copy Number Variation and Multicopy Genes [J].
Sudmant, Peter H. ;
Kitzman, Jacob O. ;
Antonacci, Francesca ;
Alkan, Can ;
Malig, Maika ;
Tsalenko, Anya ;
Sampas, Nick ;
Bruhn, Laurakay ;
Shendure, Jay ;
Eichler, Evan E. .
SCIENCE, 2010, 330 (6004) :641-646
[30]   Allele-specific DNA methylation: beyond imprinting [J].
Tycko, Benjamin .
HUMAN MOLECULAR GENETICS, 2010, 19 :R210-R220