Haplotype-resolved genome sequencing of a Gujarati Indian individual

被引:168
作者
Kitzman, Jacob O. [1 ]
MacKenzie, Alexandra P. [1 ]
Adey, Andrew [1 ]
Hiatt, Joseph B. [1 ]
Patwardhan, Rupali P. [1 ]
Sudmant, Peter H. [1 ]
Ng, Sarah B. [1 ]
Alkan, Can [1 ,2 ]
Qiu, Ruolan [1 ]
Eichler, Evan E. [1 ,2 ]
Shendure, Jay [1 ]
机构
[1] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA 98195 USA
[2] Howard Hughes Med Inst, Seattle, WA USA
基金
美国国家卫生研究院; 美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
STRUCTURAL VARIATION; COPY-NUMBER; SHORT-READ; ALGORITHMS;
D O I
10.1038/nbt.1740
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Haplotype information is essential to the complete description and interpretation of genomes(1), genetic diversity(2) and genetic ancestry(3). Although individual human genome sequencing is increasingly routine(4), nearly all such genomes are unresolved with respect to haplotype. Here we combine the throughput of massively parallel sequencing(5) with the contiguity information provided by large-insert cloning(6) to experimentally determine the haplotype-resolved genome of a South Asian individual. A single fosmid library was split into a modest number of pools, each providing similar to 3% physical coverage of the diploid genome. Sequencing of each pool yielded reads overwhelmingly derived from only one homologous chromosome at any given location. These data were combined with whole-genome shotgun sequence to directly phase 94% of ascertained heterozygous single nucleotide polymorphisms (SNPs) into long haplotype blocks (N50 of 386 kilobases (kbp)). This method also facilitates the analysis of structural variation, for example, to anchor novel insertions(7,8) to specific locations and haplotypes.
引用
收藏
页码:59 / +
页数:6
相关论文
共 33 条
[1]   Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition [J].
Adey, Andrew ;
Morrison, Hilary G. ;
Asan ;
Xun, Xu ;
Kitzman, Jacob O. ;
Turner, Emily H. ;
Stackhouse, Bethany ;
MacKenzie, Alexandra P. ;
Caruccio, Nicholas C. ;
Zhang, Xiuqing ;
Shendure, Jay .
GENOME BIOLOGY, 2010, 11 (12)
[2]   Personalized copy number and segmental duplication maps using next-generation sequencing [J].
Alkan, Can ;
Kidd, Jeffrey M. ;
Marques-Bonet, Tomas ;
Aksay, Gozde ;
Antonacci, Francesca ;
Hormozdiari, Fereydoun ;
Kitzman, Jacob O. ;
Baker, Carl ;
Malig, Maika ;
Mutlu, Onur ;
Sahinalp, S. Cenk ;
Gibbs, Richard A. ;
Eichler, Evan E. .
NATURE GENETICS, 2009, 41 (10) :1061-U29
[3]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[4]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[5]   Genomes by the thousand [J].
不详 .
NATURE, 2010, 467 (7319) :1026-1027
[6]   HapCUT: an efficient and accurate algorithm for the haplotype assembly problem [J].
Bansal, Vikas ;
Bafna, Vineet .
BIOINFORMATICS, 2008, 24 (16) :I153-I159
[7]   An MCMC algorithm for haplotype assembly from whole-genome sequence data [J].
Bansal, Vikas ;
Halpern, Aaron L. ;
Axelrod, Nelson ;
Bafna, Vineet .
GENOME RESEARCH, 2008, 18 (08) :1336-1346
[8]   Complex promoter and coding region β2-adrenergic receptor haplotypes alter receptor expression and predict in vivo responsiveness [J].
Drysdale, CM ;
McGraw, DW ;
Stack, CB ;
Stephens, JC ;
Judson, RS ;
Nandabalan, K ;
Arnold, K ;
Ruano, G ;
Liggett, SB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (19) :10483-10488
[9]   A Draft Sequence of the Neandertal Genome [J].
Green, Richard E. ;
Krause, Johannes ;
Briggs, Adrian W. ;
Maricic, Tomislav ;
Stenzel, Udo ;
Kircher, Martin ;
Patterson, Nick ;
Li, Heng ;
Zhai, Weiwei ;
Fritz, Markus Hsi-Yang ;
Hansen, Nancy F. ;
Durand, Eric Y. ;
Malaspinas, Anna-Sapfo ;
Jensen, Jeffrey D. ;
Marques-Bonet, Tomas ;
Alkan, Can ;
Pruefer, Kay ;
Meyer, Matthias ;
Burbano, Hernan A. ;
Good, Jeffrey M. ;
Schultz, Rigo ;
Aximu-Petri, Ayinuer ;
Butthof, Anne ;
Hoeber, Barbara ;
Hoeffner, Barbara ;
Siegemund, Madlen ;
Weihmann, Antje ;
Nusbaum, Chad ;
Lander, Eric S. ;
Russ, Carsten ;
Novod, Nathaniel ;
Affourtit, Jason ;
Egholm, Michael ;
Verna, Christine ;
Rudan, Pavao ;
Brajkovic, Dejana ;
Kucan, Zeljko ;
Gusic, Ivan ;
Doronichev, Vladimir B. ;
Golovanova, Liubov V. ;
Lalueza-Fox, Carles ;
de la Rasilla, Marco ;
Fortea, Javier ;
Rosas, Antonio ;
Schmitz, Ralf W. ;
Johnson, Philip L. F. ;
Eichler, Evan E. ;
Falush, Daniel ;
Birney, Ewan ;
Mullikin, James C. .
SCIENCE, 2010, 328 (5979) :710-722
[10]   Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery [J].
Hormozdiari, Fereydoun ;
Hajirasouliha, Iman ;
Dao, Phuong ;
Hach, Faraz ;
Yorukoglu, Deniz ;
Alkan, Can ;
Eichler, Evan E. ;
Sahinalp, S. Cenk .
BIOINFORMATICS, 2010, 26 (12) :i350-i357