The diploid genome sequence of an Asian individual

被引:663
作者
Wang, Jun [1 ,2 ,3 ,4 ]
Wang, Wei [1 ,3 ]
Li, Ruiqiang [1 ,3 ,4 ]
Li, Yingrui [1 ,5 ,6 ]
Tian, Geng [1 ,7 ]
Goodman, Laurie [1 ]
Fan, Wei [1 ]
Zhang, Junqing [1 ]
Li, Jun [1 ]
Zhang, Juanbin [1 ]
Guo, Yiran [1 ,7 ]
Feng, Binxiao [1 ]
Li, Heng [1 ,8 ]
Lu, Yao [1 ]
Fang, Xiaodong [1 ]
Liang, Huiqing [1 ]
Du, Zhenglin [1 ]
Li, Dong [1 ]
Zhao, Yiqing [1 ,7 ]
Hu, Yujie [1 ,7 ]
Yang, Zhenzhen [1 ]
Zheng, Hancheng [1 ]
Hellmann, Ines [9 ,10 ]
Inouye, Michael [8 ]
Pool, John [9 ,10 ]
Yi, Xin [1 ,7 ]
Zhao, Jing [1 ]
Duan, Jinjie [1 ]
Zhou, Yan [1 ]
Qin, Junjie [1 ,7 ]
Ma, Lijia [1 ,7 ]
Li, Guoqing [1 ]
Yang, Zhentao [1 ]
Zhang, Guojie [1 ,7 ]
Yang, Bin [1 ]
Yu, Chang [1 ]
Liang, Fang [1 ,7 ]
Li, Wenjie [1 ]
Li, Shaochuan [1 ]
Li, Dawei [1 ]
Ni, Peixiang [1 ]
Ruan, Jue [1 ,7 ]
Li, Qibin [1 ,7 ]
Zhu, Hongmei [1 ]
Liu, Dongyuan [1 ]
Lu, Zhike [1 ]
Li, Ning [1 ,7 ]
Guo, Guangwu [1 ,7 ]
Zhang, Jianguo [1 ]
Ye, Jia [1 ]
机构
[1] Beijing Genom Inst Shenzhen, Shenzhen 518000, Peoples R China
[2] Shenzhen Univ Med Sch, Genome Res Inst, Shenzhen 518000, Peoples R China
[3] Natl Engn Ctr Genom & Bioinformat, Beijing 101300, Peoples R China
[4] Univ So Denmark, Dept Biochem & Mol Biol, DK-5230 Odense M, Denmark
[5] Peking Univ, Coll Life Sci, Beijing 100871, Peoples R China
[6] Chinese Acad Sci, Beijing Inst Genom, Beijing Genom Inst, Beijing 101300, Peoples R China
[7] Grad Univ Chinese Acad Sci, Beijing 100062, Peoples R China
[8] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[9] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[10] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[11] Univ Alberta, Dept Biol Sci, Edmonton, AB T6G 2E9, Canada
[12] Univ Alberta, Dept Med, Edmonton, AB T6G 2E9, Canada
[13] Aarhus Univ, Inst Human Genet, DK-8000 Aarhus, Denmark
基金
中国国家自然科学基金;
关键词
D O I
10.1038/nature07484
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36- fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high- quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single- nucleotide polymorphisms ( SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes ( Chinese and Japanese, respectively), sequence comparison with the two available individual genomes ( J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next- generation sequencing technologies for personal genomics.
引用
收藏
页码:60 / U1
页数:7
相关论文
共 20 条
[1]   Closing gaps in the human genome with fosmid resources generated from multiple individuals (Reprinted from Nature Genetics, vol 40, pg 96-101, 2008) [J].
Bovee, Donald ;
Zhou, Yang ;
Haugen, Eric ;
Wu, Zaining ;
Hayden, Hillary S. ;
Gillett, Will ;
Tuzun, Eray ;
Cooper, Gregory M. ;
Sampas, Nick ;
Phelps, Karen ;
Levy, Ruth ;
Morrison, V. Anne ;
Sprague, James ;
Jewett, Donald ;
Buckley, Danielle ;
Subramaniam, Sandhya ;
Chang, Jean ;
Smith, Douglas R. ;
Olson, Maynard V. ;
Eichler, Evan E. ;
Kaul, Rajinder .
NATURE GENETICS, 2009, :S31-S36
[2]   The Personal Genome Project [J].
Church, G. M. .
MOLECULAR SYSTEMS BIOLOGY, 2005, 1 (1)
[3]   A high-density whole-genome association study reveals that APOE is the major susceptibility gene for sporadic late-onset Alzheimer's disease [J].
Coon, Keith D. ;
Myers, Amanda J. ;
Craig, David W. ;
Webster, Jennifer A. ;
Pearson, John V. ;
Lince, Diane Hu ;
Zismann, Victoria L. ;
Beach, Thomas G. ;
Leung, Doris ;
Bryden, Leslie ;
Halperin, Rebecca F. ;
Marlowe, Lauren ;
Kaleem, Mona ;
Walker, Douglas G. ;
Ravid, Rivka ;
Heward, Christopher B. ;
Rogers, Joseph ;
Papassotiropoulos, Andreas ;
Reiman, Eric M. ;
Hardy, John ;
Stephan, Dietrich A. .
JOURNAL OF CLINICAL PSYCHIATRY, 2007, 68 (04) :613-618
[4]   Detection of large-scale variation in the human genome [J].
Iafrate, AJ ;
Feuk, L ;
Rivera, MN ;
Listewnik, ML ;
Donahoe, PK ;
Qi, Y ;
Scherer, SW ;
Lee, C .
NATURE GENETICS, 2004, 36 (09) :949-951
[5]   Mapping and sequencing of structural variation from eight human genomes (Reprinted from Nature, vol 453, pg 56-64, 2008) [J].
Kidd, Jeffrey M. ;
Cooper, Gregory M. ;
Donahue, William F. ;
Hayden, Hillary S. ;
Sampas, Nick ;
Graves, Tina ;
Hansen, Nancy ;
Teague, Brian ;
Alkan, Can ;
Antonacci, Francesca ;
Haugen, Eric ;
Zerr, Troy ;
Yamada, N. Alice ;
Tsang, Peter ;
Newman, Tera L. ;
Tuzun, Eray ;
Cheng, Ze ;
Ebling, Heather M. ;
Tusneem, Nadeem ;
David, Robert ;
Gillett, Will ;
Phelps, Karen A. ;
Weaver, Molly ;
Saranga, David ;
Brand, Adrianne ;
Tao, Wei ;
Gustafson, Erik ;
McKernan, Kevin ;
Chen, Lin ;
Malig, Maika ;
Smith, Joshua D. ;
Korn, Joshua M. ;
McCarroll, Steven A. ;
Altshuler, David A. ;
Peiffer, Daniel A. ;
Dorschner, Michael ;
Stamatoyannopoulos, John ;
Schwartz, David ;
Nickerson, Deborah A. ;
Mullikin, James C. ;
Wilson, Richard K. ;
Bruhn, Laurakay ;
Olson, Maynard V. ;
Kaul, Rajinder ;
Smith, Douglas R. ;
Eichler, Evan E. .
NATURE GENETICS, 2009, :S22-S30
[6]  
Korbel JO, 2007, SCIENCE, V318, P420, DOI 10.1126/science.1149504
[7]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[8]   The diploid genome sequence of an individual human [J].
Levy, Samuel ;
Sutton, Granger ;
Ng, Pauline C. ;
Feuk, Lars ;
Halpern, Aaron L. ;
Walenz, Brian P. ;
Axelrod, Nelson ;
Huang, Jiaqi ;
Kirkness, Ewen F. ;
Denisov, Gennady ;
Lin, Yuan ;
MacDonald, Jeffrey R. ;
Pang, Andy Wing Chun ;
Shago, Mary ;
Stockwell, Timothy B. ;
Tsiamouri, Alexia ;
Bafna, Vineet ;
Bansal, Vikas ;
Kravitz, Saul A. ;
Busam, Dana A. ;
Beeson, Karen Y. ;
Mclntosh, Tina C. ;
Remington, Karin A. ;
Abril, Josep F. ;
Gill, John ;
Borman, Jon ;
Rogers, Yu-Hui ;
Frazier, Marvin E. ;
Scherer, Stephen W. ;
Strausberg, Robert L. ;
Venter, J. Craig .
PLOS BIOLOGY, 2007, 5 (10) :2113-2144
[9]   SOAP: short oligonucleotide alignment program [J].
Li, Ruiqiang ;
Li, Yingrui ;
Kristiansen, Karsten ;
Wang, Jun .
BIOINFORMATICS, 2008, 24 (05) :713-714
[10]   Mendelian inheritance in man and its online version, OMIM [J].
McKusick, Victor A. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (04) :588-604