A highly annotated whole-genome sequence of a Korean individual

被引:240
作者
Kim, Jong-Il [1 ,2 ,4 ,5 ]
Ju, Young Seok [1 ,2 ]
Park, Hansoo [1 ,5 ]
Kim, Sheehyun [4 ]
Lee, Seonwook [4 ]
Yi, Jae-Hyuk [1 ]
Mudge, Joann [6 ]
Miller, Neil A. [6 ]
Hong, Dongwan [1 ]
Bell, Callum J. [6 ]
Kim, Hye-Sun [4 ]
Chung, In-Soon [4 ]
Lee, Woo-Chung [4 ]
Lee, Ji-Sun [4 ]
Seo, Seung-Hyun [5 ]
Yun, Ji-Young [5 ]
Woo, Hyun Nyun [4 ]
Lee, Heewook [4 ]
Suh, Dongwhan [1 ,2 ,3 ]
Lee, Seungbok [1 ,2 ,3 ]
Kim, Hyun-Jin [1 ,3 ]
Yavartanoo, Maryam [1 ,2 ]
Kwak, Minhye [1 ,2 ]
Zheng, Ying [1 ,2 ]
Lee, Mi Kyeong [5 ]
Park, Hyunjun [1 ]
Kim, Jeong Yeon [1 ]
Gokcumen, Omer [7 ]
Mills, Ryan E. [6 ,7 ]
Zaranek, Alexander Wait [8 ]
Thakuria, Joseph [8 ]
Wu, Xiaodi [8 ]
Kim, Ryan W.
Huntley, Jim J. [9 ]
Luo, Shujun [9 ]
Schroth, Gary P. [9 ]
Wu, Thomas D. [10 ]
Kim, HyeRan [4 ]
Yang, Kap-Seok [4 ]
Park, Woong-Yang [1 ,2 ,3 ]
Kim, Hyungtae [4 ]
Church, George M. [8 ]
Lee, Charles [7 ]
Kingsmore, Stephen F. [6 ]
Seo, Jeong-Sun [1 ,2 ,3 ,4 ,5 ]
机构
[1] Seoul Natl Univ, Med Res Ctr, Genom Med Inst, Seoul 110799, South Korea
[2] Seoul Natl Univ, Grad Sch, Coll Med, Dept Biochem & Mol Biol, Seoul 110799, South Korea
[3] Seoul Natl Univ, Grad Sch, Dept Biomed Sci, Seoul 110799, South Korea
[4] Macrogen Inc, Seoul 153023, South Korea
[5] Psoma Therapeut Inc, Seoul 110799, South Korea
[6] Natl Ctr Genome Resources, Santa Fe, NM 87505 USA
[7] Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[8] Harvard Univ, Sch Med, Dept Genet, Boston, MA 02115 USA
[9] Illumina Inc, Hayward, CA 94545 USA
[10] Genentech Inc, Dept Bioinformat, San Francisco, CA 94080 USA
基金
美国国家卫生研究院;
关键词
MESSENGER-RNA;
D O I
10.1038/nature08211
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in sequencing technologies have initiated an era of personal genome sequences. To date, human genome sequences have been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China(1-4). Here we provide a highly annotated, whole-genome sequence for a Korean individual, known as AK1. The genome of AK1 was determined by an exacting, combined approach that included whole-genome shot-gun sequencing (27.8x coverage), targeted bacterial artificial chromosome sequencing, and high-resolution comparative genomic hybridization using custom microarrays featuring more than 24 million probes. Alignment to the NCBI reference, a composite of several ethnic clades(5,6), disclosed nearly 3.45 million single nucleotide polymorphisms ( SNPs), including 10,162 non-synonymous SNPs, and 170,202 deletion or insertion polymorphisms (indels). SNP and indel densities were strongly correlated genome-wide. Applying very conservative criteria yielded highly reliable copy number variants for clinical considerations. Potential medical phenotypes were annotated for non-synonymous SNPs, coding domain indels, and structural variants. The integration of several human whole-genome sequences derived from several ethnic groups will assist in understanding genetic ancestry, migration patterns and population bottlenecks.
引用
收藏
页码:1011 / U96
页数:6
相关论文
共 22 条
  • [1] Accurate whole human genome sequencing using reversible terminator chemistry
    Bentley, David R.
    Balasubramanian, Shankar
    Swerdlow, Harold P.
    Smith, Geoffrey P.
    Milton, John
    Brown, Clive G.
    Hall, Kevin P.
    Evers, Dirk J.
    Barnes, Colin L.
    Bignell, Helen R.
    Boutell, Jonathan M.
    Bryant, Jason
    Carter, Richard J.
    Cheetham, R. Keira
    Cox, Anthony J.
    Ellis, Darren J.
    Flatbush, Michael R.
    Gormley, Niall A.
    Humphray, Sean J.
    Irving, Leslie J.
    Karbelashvili, Mirian S.
    Kirk, Scott M.
    Li, Heng
    Liu, Xiaohai
    Maisinger, Klaus S.
    Murray, Lisa J.
    Obradovic, Bojan
    Ost, Tobias
    Parkinson, Michael L.
    Pratt, Mark R.
    Rasolonjatovo, Isabelle M. J.
    Reed, Mark T.
    Rigatti, Roberto
    Rodighiero, Chiara
    Ross, Mark T.
    Sabot, Andrea
    Sankar, Subramanian V.
    Scally, Aylwyn
    Schroth, Gary P.
    Smith, Mark E.
    Smith, Vincent P.
    Spiridou, Anastassia
    Torrance, Peta E.
    Tzonev, Svilen S.
    Vermaas, Eric H.
    Walter, Klaudia
    Wu, Xiaolin
    Zhang, Lu
    Alam, Mohammed D.
    Anastasi, Carole
    [J]. NATURE, 2008, 456 (7218) : 53 - 59
  • [2] Dynamic building of a BAC clone tiling path for the Rat Genome Sequencing Project
    Chen, R
    Sodergren, E
    Weinstock, GM
    Gibbs, RA
    [J]. GENOME RESEARCH, 2004, 14 (04) : 679 - 684
  • [3] Construction of a bacterial artificial chromosome library containing large EcoRI and HindIII genomic fragments of lettuce
    Frijters, ACJ
    Zhang, Z
    vanDamme, M
    Wang, GL
    Ronald, PC
    Michelmore, RW
    [J]. THEORETICAL AND APPLIED GENETICS, 1997, 94 (3-4) : 390 - 399
  • [4] Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution
    Hardison, RC
    Roskin, KM
    Yang, S
    Diekhans, M
    Kent, WJ
    Weber, R
    Elnitski, L
    Li, J
    O'Connor, M
    Kolbe, D
    Schwartz, S
    Furey, TS
    Whelan, S
    Goldman, N
    Smit, A
    Miller, W
    Chiaromonte, F
    Haussler, D
    [J]. GENOME RESEARCH, 2003, 13 (01) : 13 - 26
  • [5] Evidence for natural selection on leukocyte immunoglobulin-like receptors for HLA class I in Northeast Asians
    Hirayasu, Kouyuki
    Ohashi, Jun
    Tanaka, Hidenori
    Kashiwase, Koichi
    Ogawa, Atsuko
    Takanashi, Minoko
    Satake, Masahiro
    Jia, Guan Jun
    Chimge, Nyam-Osor
    Sideltseva, Elena W.
    Tokunaga, Katsushi
    Yabe, Toshio
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (05) : 1075 - 1083
  • [6] Direct estimates of human per nucleotide mutation rates at 20 loci causing Mendelian diseases
    Kondrashov, AS
    [J]. HUMAN MUTATION, 2003, 21 (01) : 12 - 27
  • [7] Initial sequencing and analysis of the human genome
    Lander, ES
    Int Human Genome Sequencing Consortium
    Linton, LM
    Birren, B
    Nusbaum, C
    Zody, MC
    Baldwin, J
    Devon, K
    Dewar, K
    Doyle, M
    FitzHugh, W
    Funke, R
    Gage, D
    Harris, K
    Heaford, A
    Howland, J
    Kann, L
    Lehoczky, J
    LeVine, R
    McEwan, P
    McKernan, K
    Meldrim, J
    Mesirov, JP
    Miranda, C
    Morris, W
    Naylor, J
    Raymond, C
    Rosetti, M
    Santos, R
    Sheridan, A
    Sougnez, C
    Stange-Thomann, N
    Stojanovic, N
    Subramanian, A
    Wyman, D
    Rogers, J
    Sulston, J
    Ainscough, R
    Beck, S
    Bentley, D
    Burton, J
    Clee, C
    Carter, N
    Coulson, A
    Deadman, R
    Deloukas, P
    Dunham, A
    Dunham, I
    Durbin, R
    French, L
    [J]. NATURE, 2001, 409 (6822) : 860 - 921
  • [8] Genetic Risk Factors for Rheumatoid Arthritis Differ in Caucasian and Korean Populations
    Lee, Hye-Soon
    Korman, Benjamin D.
    Le, Julie M.
    Kastner, Daniel L.
    Remmers, Elaine F.
    Gregersen, Peter K.
    Bae, Sang-Cheol
    [J]. ARTHRITIS AND RHEUMATISM, 2009, 60 (02): : 364 - 371
  • [9] The diploid genome sequence of an individual human
    Levy, Samuel
    Sutton, Granger
    Ng, Pauline C.
    Feuk, Lars
    Halpern, Aaron L.
    Walenz, Brian P.
    Axelrod, Nelson
    Huang, Jiaqi
    Kirkness, Ewen F.
    Denisov, Gennady
    Lin, Yuan
    MacDonald, Jeffrey R.
    Pang, Andy Wing Chun
    Shago, Mary
    Stockwell, Timothy B.
    Tsiamouri, Alexia
    Bafna, Vineet
    Bansal, Vikas
    Kravitz, Saul A.
    Busam, Dana A.
    Beeson, Karen Y.
    Mclntosh, Tina C.
    Remington, Karin A.
    Abril, Josep F.
    Gill, John
    Borman, Jon
    Rogers, Yu-Hui
    Frazier, Marvin E.
    Scherer, Stephen W.
    Strausberg, Robert L.
    Venter, J. Craig
    [J]. PLOS BIOLOGY, 2007, 5 (10) : 2113 - 2144
  • [10] In polymorphic genomic regions indels cluster with nucleotide polymorphism: Quantum Genomics
    Longman-Jacobsen, N
    Williamson, JF
    Dawkins, RL
    Gaudieri, S
    [J]. GENE, 2003, 312 : 257 - 261