A comprehensively molecular haplotype-resolved genome of a European individual

被引:56
作者
Suk, Eun-Kyung [1 ]
McEwen, Gayle K. [1 ]
Duitama, Jorge [1 ]
Nowick, Katja [1 ]
Schulz, Sabrina [1 ]
Palczewski, Stefanie [1 ]
Schreiber, Stefan [2 ]
Holloway, Dustin T. [3 ]
McLaughlin, Stephen [3 ]
Peckham, Heather [3 ]
Lee, Clarence [3 ]
Huebsch, Thomas [1 ]
Hoehe, Margret R. [1 ]
机构
[1] Max Planck Inst Mol Genet, Dept Vertebrate Genom, D-14195 Berlin, Germany
[2] Univ Kiel, Inst Clin Mol Biol, D-24105 Kiel, Germany
[3] Life Technol, Beverly, MA 01915 USA
关键词
ALLELE-SPECIFIC EXPRESSION; GENE-EXPRESSION; SEQUENCE; DISEASE; PROTEINS; BRCA1; PHASE;
D O I
10.1101/gr.125047.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Independent determination of both haplotype sequences of an individual genome is essential to relate genetic variation to genome function, phenotype, and disease. To address the importance of phase, we have generated the most complete haplotype-resolved genome to date, "Max Planck One'' (MP1), by fosmid pool-based next generation sequencing. Virtually all SNPs (>99%) and 80,000 indels were phased into haploid sequences of up to 6.3 Mb (N50 similar to 1 Mb). The completeness of phasing allowed determination of the concrete molecular haplotype pairs for the vast majority of genes (81%) including potential regulatory sequences, of which >90% were found to be constituted by two different molecular forms. A subset of 159 genes with potentially severe mutations in either cis or trans configurations exemplified in particular the role of phase for gene function, disease, and clinical interpretation of personal genomes (e.g., BRCA1). Extended genomic regions harboring manifold combinations of physically and/or functionally related genes and regulatory elements were resolved into their underlying "haploid landscapes,'' which may define the functional genome. Moreover, the majority of genes and functional sequences were found to contain individual or rare SNPs, which cannot be phased from population data alone, emphasizing the importance of molecular phasing for characterizing a genome in its molecular individuality. Our work provides the foundation to understand that the distinction of molecular haplotypes is essential to resolve the (inherently individual) biology of genes, genomes, and disease, establishing a reference point for "phase-sensitive'' personal genomics. MP1's annotated haploid genomes are available as a public resource.
引用
收藏
页码:1672 / 1685
页数:14
相关论文
共 63 条
[1]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]   Clinical assessment incorporating a personal genome [J].
Ashley, Euan A. ;
Butte, Atul J. ;
Wheeler, Matthew T. ;
Chen, Rong ;
Klein, Teri E. ;
Dewey, Frederick E. ;
Dudley, Joel T. ;
Ormond, Kelly E. ;
Pavlovic, Aleksandra ;
Morgan, Alexander A. ;
Pushkarev, Dmitry ;
Neff, Norma F. ;
Hudgins, Louanne ;
Gong, Li ;
Hodges, Laura M. ;
Berlin, Dorit S. ;
Thorn, Caroline F. ;
Sangkuhl, Katrin ;
Hebert, Joan M. ;
Woon, Mark ;
Sagreiya, Hersh ;
Whaley, Ryan ;
Knowles, Joshua W. ;
Chou, Michael F. ;
Thakuria, Joseph V. ;
Rosenbaum, Abraham M. ;
Zaranek, Alexander Wait ;
Church, George M. ;
Greely, Henry T. ;
Quake, Stephen R. ;
Altman, Russ B. .
LANCET, 2010, 375 (9725) :1525-1535
[4]   HapCUT: an efficient and accurate algorithm for the haplotype assembly problem [J].
Bansal, Vikas ;
Bafna, Vineet .
BIOINFORMATICS, 2008, 24 (16) :I153-I159
[5]   The next phase in human genetics [J].
Bansal, Vikas ;
Tewhey, Ryan ;
Topol, Eric J. ;
Schork, Nicholas J. .
NATURE BIOTECHNOLOGY, 2011, 29 (01) :38-39
[6]   GOstat: find statistically overrepresented Gene Ontologies within a group of genes [J].
Beissbarth, T ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (09) :1464-1465
[7]   Allelic phasing of a mouse chromosome 11 deficiency influences p53 tumorigenicity [J].
Biggs, PJ ;
Vogel, H ;
Sage, M ;
Martin, LA ;
Donehower, LA ;
Bradley, A .
ONCOGENE, 2003, 22 (21) :3288-3296
[8]   Olfactory receptors: molecular basis for recognition and discrimination of odors [J].
Breer, H .
ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2003, 377 (03) :427-433
[9]   Clone-based systematic haplotyping (CSH): A procedure for physical haplotyping of whole genomes [J].
Burgtorf, C ;
Kepper, P ;
Hoehe, M ;
Schmitt, C ;
Reinhardt, R ;
Lehrach, H ;
Sauer, S .
GENOME RESEARCH, 2003, 13 (12) :2717-2724
[10]   CYP4F2 genetic variant alters required warfarin dose [J].
Caldwell, Michael D. ;
Awad, Tarif ;
Johnson, Julie A. ;
Gage, Brian F. ;
Falkowski, Mat ;
Gardina, Paul ;
Hubbard, Jason ;
Turpaz, Yaron ;
Langaee, Taimour Y. ;
Eby, Charles ;
King, Cristi R. ;
Brower, Amy ;
Schmelzer, John R. ;
Glurich, Ingrid ;
Vidaillet, Humberto J. ;
Yale, Steven H. ;
Zhang, Kai Qi ;
Berg, Richard L. ;
Burmester, James K. .
BLOOD, 2008, 111 (08) :4106-4112