A highly annotated whole-genome sequence of a Korean individual

被引:240
作者
Kim, Jong-Il [1 ,2 ,4 ,5 ]
Ju, Young Seok [1 ,2 ]
Park, Hansoo [1 ,5 ]
Kim, Sheehyun [4 ]
Lee, Seonwook [4 ]
Yi, Jae-Hyuk [1 ]
Mudge, Joann [6 ]
Miller, Neil A. [6 ]
Hong, Dongwan [1 ]
Bell, Callum J. [6 ]
Kim, Hye-Sun [4 ]
Chung, In-Soon [4 ]
Lee, Woo-Chung [4 ]
Lee, Ji-Sun [4 ]
Seo, Seung-Hyun [5 ]
Yun, Ji-Young [5 ]
Woo, Hyun Nyun [4 ]
Lee, Heewook [4 ]
Suh, Dongwhan [1 ,2 ,3 ]
Lee, Seungbok [1 ,2 ,3 ]
Kim, Hyun-Jin [1 ,3 ]
Yavartanoo, Maryam [1 ,2 ]
Kwak, Minhye [1 ,2 ]
Zheng, Ying [1 ,2 ]
Lee, Mi Kyeong [5 ]
Park, Hyunjun [1 ]
Kim, Jeong Yeon [1 ]
Gokcumen, Omer [7 ]
Mills, Ryan E. [6 ,7 ]
Zaranek, Alexander Wait [8 ]
Thakuria, Joseph [8 ]
Wu, Xiaodi [8 ]
Kim, Ryan W.
Huntley, Jim J. [9 ]
Luo, Shujun [9 ]
Schroth, Gary P. [9 ]
Wu, Thomas D. [10 ]
Kim, HyeRan [4 ]
Yang, Kap-Seok [4 ]
Park, Woong-Yang [1 ,2 ,3 ]
Kim, Hyungtae [4 ]
Church, George M. [8 ]
Lee, Charles [7 ]
Kingsmore, Stephen F. [6 ]
Seo, Jeong-Sun [1 ,2 ,3 ,4 ,5 ]
机构
[1] Seoul Natl Univ, Med Res Ctr, Genom Med Inst, Seoul 110799, South Korea
[2] Seoul Natl Univ, Grad Sch, Coll Med, Dept Biochem & Mol Biol, Seoul 110799, South Korea
[3] Seoul Natl Univ, Grad Sch, Dept Biomed Sci, Seoul 110799, South Korea
[4] Macrogen Inc, Seoul 153023, South Korea
[5] Psoma Therapeut Inc, Seoul 110799, South Korea
[6] Natl Ctr Genome Resources, Santa Fe, NM 87505 USA
[7] Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[8] Harvard Univ, Sch Med, Dept Genet, Boston, MA 02115 USA
[9] Illumina Inc, Hayward, CA 94545 USA
[10] Genentech Inc, Dept Bioinformat, San Francisco, CA 94080 USA
基金
美国国家卫生研究院;
关键词
MESSENGER-RNA;
D O I
10.1038/nature08211
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in sequencing technologies have initiated an era of personal genome sequences. To date, human genome sequences have been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China(1-4). Here we provide a highly annotated, whole-genome sequence for a Korean individual, known as AK1. The genome of AK1 was determined by an exacting, combined approach that included whole-genome shot-gun sequencing (27.8x coverage), targeted bacterial artificial chromosome sequencing, and high-resolution comparative genomic hybridization using custom microarrays featuring more than 24 million probes. Alignment to the NCBI reference, a composite of several ethnic clades(5,6), disclosed nearly 3.45 million single nucleotide polymorphisms ( SNPs), including 10,162 non-synonymous SNPs, and 170,202 deletion or insertion polymorphisms (indels). SNP and indel densities were strongly correlated genome-wide. Applying very conservative criteria yielded highly reliable copy number variants for clinical considerations. Potential medical phenotypes were annotated for non-synonymous SNPs, coding domain indels, and structural variants. The integration of several human whole-genome sequences derived from several ethnic groups will assist in understanding genetic ancestry, migration patterns and population bottlenecks.
引用
收藏
页码:1011 / U96
页数:6
相关论文
共 22 条
[1]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[2]   Dynamic building of a BAC clone tiling path for the Rat Genome Sequencing Project [J].
Chen, R ;
Sodergren, E ;
Weinstock, GM ;
Gibbs, RA .
GENOME RESEARCH, 2004, 14 (04) :679-684
[3]   Construction of a bacterial artificial chromosome library containing large EcoRI and HindIII genomic fragments of lettuce [J].
Frijters, ACJ ;
Zhang, Z ;
vanDamme, M ;
Wang, GL ;
Ronald, PC ;
Michelmore, RW .
THEORETICAL AND APPLIED GENETICS, 1997, 94 (3-4) :390-399
[4]   Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution [J].
Hardison, RC ;
Roskin, KM ;
Yang, S ;
Diekhans, M ;
Kent, WJ ;
Weber, R ;
Elnitski, L ;
Li, J ;
O'Connor, M ;
Kolbe, D ;
Schwartz, S ;
Furey, TS ;
Whelan, S ;
Goldman, N ;
Smit, A ;
Miller, W ;
Chiaromonte, F ;
Haussler, D .
GENOME RESEARCH, 2003, 13 (01) :13-26
[5]   Evidence for natural selection on leukocyte immunoglobulin-like receptors for HLA class I in Northeast Asians [J].
Hirayasu, Kouyuki ;
Ohashi, Jun ;
Tanaka, Hidenori ;
Kashiwase, Koichi ;
Ogawa, Atsuko ;
Takanashi, Minoko ;
Satake, Masahiro ;
Jia, Guan Jun ;
Chimge, Nyam-Osor ;
Sideltseva, Elena W. ;
Tokunaga, Katsushi ;
Yabe, Toshio .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (05) :1075-1083
[6]   Direct estimates of human per nucleotide mutation rates at 20 loci causing Mendelian diseases [J].
Kondrashov, AS .
HUMAN MUTATION, 2003, 21 (01) :12-27
[7]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[8]   Genetic Risk Factors for Rheumatoid Arthritis Differ in Caucasian and Korean Populations [J].
Lee, Hye-Soon ;
Korman, Benjamin D. ;
Le, Julie M. ;
Kastner, Daniel L. ;
Remmers, Elaine F. ;
Gregersen, Peter K. ;
Bae, Sang-Cheol .
ARTHRITIS AND RHEUMATISM, 2009, 60 (02) :364-371
[9]   The diploid genome sequence of an individual human [J].
Levy, Samuel ;
Sutton, Granger ;
Ng, Pauline C. ;
Feuk, Lars ;
Halpern, Aaron L. ;
Walenz, Brian P. ;
Axelrod, Nelson ;
Huang, Jiaqi ;
Kirkness, Ewen F. ;
Denisov, Gennady ;
Lin, Yuan ;
MacDonald, Jeffrey R. ;
Pang, Andy Wing Chun ;
Shago, Mary ;
Stockwell, Timothy B. ;
Tsiamouri, Alexia ;
Bafna, Vineet ;
Bansal, Vikas ;
Kravitz, Saul A. ;
Busam, Dana A. ;
Beeson, Karen Y. ;
Mclntosh, Tina C. ;
Remington, Karin A. ;
Abril, Josep F. ;
Gill, John ;
Borman, Jon ;
Rogers, Yu-Hui ;
Frazier, Marvin E. ;
Scherer, Stephen W. ;
Strausberg, Robert L. ;
Venter, J. Craig .
PLOS BIOLOGY, 2007, 5 (10) :2113-2144
[10]   In polymorphic genomic regions indels cluster with nucleotide polymorphism: Quantum Genomics [J].
Longman-Jacobsen, N ;
Williamson, JF ;
Dawkins, RL ;
Gaudieri, S .
GENE, 2003, 312 :257-261