Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals

被引:110
作者
Ju, Young Seok [1 ,2 ]
Kim, Jong-Il [1 ,3 ,4 ,5 ]
Kim, Sheehyun [1 ,2 ]
Hong, Dongwan [1 ]
Park, Hansoo [1 ,6 ,7 ]
Shin, Jong-Yeon [1 ,5 ]
Lee, Seungbok [1 ,4 ]
Lee, Won-Chul [1 ,4 ]
Kim, Sujung [5 ]
Yu, Saet-Byeol [5 ]
Park, Sung-Soo [5 ]
Seo, Seung-Hyun [5 ]
Yun, Ji-Young [5 ]
Kim, Hyun-Jin [1 ,4 ]
Lee, Dong-Sung [1 ,4 ]
Yavartanoo, Maryam [1 ,4 ]
Kang, Hyunseok Peter [1 ]
Gokcumen, Omer [6 ,7 ]
Govindaraju, Diddahally R. [6 ,7 ]
Jung, Jung Hee [2 ]
Chong, Hyonyong [2 ,8 ]
Yang, Kap-Seok [2 ]
Kim, Hyungtae [2 ]
Lee, Charles [6 ,7 ]
Seo, Jeong-Sun [1 ,2 ,3 ,4 ,5 ,8 ]
机构
[1] Seoul Natl Univ, Med Res Ctr, GMI, Seoul, South Korea
[2] Macrogen Inc, Seoul, South Korea
[3] Seoul Natl Univ, Coll Med, Dept Biochem, Seoul, South Korea
[4] Seoul Natl Univ, Grad Sch, Dept Biomed Sci, Seoul, South Korea
[5] Psoma Therapeut Inc, Seoul, South Korea
[6] Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[7] Harvard Univ, Sch Med, Boston, MA USA
[8] Axeq Technol, Rockville, MD USA
基金
美国国家卫生研究院;
关键词
GENE-EXPRESSION; STRUCTURAL VARIANTS; EDITING SITES; FAMILY; COMMON; SNP;
D O I
10.1038/ng.872
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Massively parallel sequencing technologies have identified a broad spectrum of human genome diversity. Here we deep sequenced and correlated 18 genomes and 17 transcriptomes of unrelated Korean individuals. This has allowed us to construct a genome-wide map of common and rare variants and also identify variants formed during DNA-RNA transcription. We identified 9.56 million genomic variants, 23.2% of which appear to be previously unidentified. From transcriptome sequencing, we discovered 4,414 transcripts not previously annotated. Finally, we revealed 1,809 sites of transcriptional base modification, where the transcriptional landscape is different from the corresponding genomic sequences, and 580 sites of allele-specific expression. Our findings suggest that a considerable number of unexplored genomic variants still remain to be identified in the human genome, and that the integrated analysis of genome and transcriptome sequencing is powerful for understanding the diversity and functional aspects of human genomic variants.
引用
收藏
页码:745 / U47
页数:10
相关论文
共 50 条
[41]   Complete Khoisan and Bantu genomes from southern Africa [J].
Schuster, Stephan C. ;
Miller, Webb ;
Ratan, Aakrosh ;
Tomsho, Lynn P. ;
Giardine, Belinda ;
Kasson, Lindsay R. ;
Harris, Robert S. ;
Petersen, Desiree C. ;
Zhao, Fangqing ;
Qi, Ji ;
Alkan, Can ;
Kidd, Jeffrey M. ;
Sun, Yazhou ;
Drautz, Daniela I. ;
Bouffard, Pascal ;
Muzny, Donna M. ;
Reid, Jeffrey G. ;
Nazareth, Lynne V. ;
Wang, Qingyu ;
Burhans, Richard ;
Riemer, Cathy ;
Wittekindt, Nicola E. ;
Moorjani, Priya ;
Tindall, Elizabeth A. ;
Danko, Charles G. ;
Teo, Wee Siang ;
Buboltz, Anne M. ;
Zhang, Zhenhai ;
Ma, Qianyi ;
Oosthuysen, Arno ;
Steenkamp, Abraham W. ;
Oostuisen, Hermann ;
Venter, Philippus ;
Gajewski, John ;
Zhang, Yu ;
Pugh, B. Franklin ;
Makova, Kateryna D. ;
Nekrutenko, Anton ;
Mardis, Elaine R. ;
Patterson, Nick ;
Pringle, Tom H. ;
Chiaromonte, Francesca ;
Mullikin, James C. ;
Eichler, Evan E. ;
Hardison, Ross C. ;
Gibbs, Richard A. ;
Harkins, Timothy T. ;
Hayes, Vanessa M. .
NATURE, 2010, 463 (7283) :943-947
[42]   ABySS: A parallel assembler for short read sequence data [J].
Simpson, Jared T. ;
Wong, Kim ;
Jackman, Shaun D. ;
Schein, Jacqueline E. ;
Jones, Steven J. M. ;
Birol, Inanc .
GENOME RESEARCH, 2009, 19 (06) :1117-1123
[43]   RNA-sequence analysis of human B-cells [J].
Toung, Jonathan M. ;
Morley, Michael ;
Li, Mingyao ;
Cheung, Vivian G. .
GENOME RESEARCH, 2011, 21 (06) :991-998
[44]   The sequence of the human genome [J].
Venter, JC ;
Adams, MD ;
Myers, EW ;
Li, PW ;
Mural, RJ ;
Sutton, GG ;
Smith, HO ;
Yandell, M ;
Evans, CA ;
Holt, RA ;
Gocayne, JD ;
Amanatides, P ;
Ballew, RM ;
Huson, DH ;
Wortman, JR ;
Zhang, Q ;
Kodira, CD ;
Zheng, XQH ;
Chen, L ;
Skupski, M ;
Subramanian, G ;
Thomas, PD ;
Zhang, JH ;
Miklos, GLG ;
Nelson, C ;
Broder, S ;
Clark, AG ;
Nadeau, C ;
McKusick, VA ;
Zinder, N ;
Levine, AJ ;
Roberts, RJ ;
Simon, M ;
Slayman, C ;
Hunkapiller, M ;
Bolanos, R ;
Delcher, A ;
Dew, I ;
Fasulo, D ;
Flanigan, M ;
Florea, L ;
Halpern, A ;
Hannenhalli, S ;
Kravitz, S ;
Levy, S ;
Mobarry, C ;
Reinert, K ;
Remington, K ;
Abu-Threideh, J ;
Beasley, E .
SCIENCE, 2001, 291 (5507) :1304-+
[45]   The diploid genome sequence of an Asian individual [J].
Wang, Jun ;
Wang, Wei ;
Li, Ruiqiang ;
Li, Yingrui ;
Tian, Geng ;
Goodman, Laurie ;
Fan, Wei ;
Zhang, Junqing ;
Li, Jun ;
Zhang, Juanbin ;
Guo, Yiran ;
Feng, Binxiao ;
Li, Heng ;
Lu, Yao ;
Fang, Xiaodong ;
Liang, Huiqing ;
Du, Zhenglin ;
Li, Dong ;
Zhao, Yiqing ;
Hu, Yujie ;
Yang, Zhenzhen ;
Zheng, Hancheng ;
Hellmann, Ines ;
Inouye, Michael ;
Pool, John ;
Yi, Xin ;
Zhao, Jing ;
Duan, Jinjie ;
Zhou, Yan ;
Qin, Junjie ;
Ma, Lijia ;
Li, Guoqing ;
Yang, Zhentao ;
Zhang, Guojie ;
Yang, Bin ;
Yu, Chang ;
Liang, Fang ;
Li, Wenjie ;
Li, Shaochuan ;
Li, Dawei ;
Ni, Peixiang ;
Ruan, Jue ;
Li, Qibin ;
Zhu, Hongmei ;
Liu, Dongyuan ;
Lu, Zhike ;
Li, Ning ;
Guo, Guangwu ;
Zhang, Jianguo ;
Ye, Jia .
NATURE, 2008, 456 (7218) :60-U1
[46]   The complete genome of an individual by massively parallel DNA sequencing [J].
Wheeler, David A. ;
Srinivasan, Maithreyan ;
Egholm, Michael ;
Shen, Yufeng ;
Chen, Lei ;
McGuire, Amy ;
He, Wen ;
Chen, Yi-Ju ;
Makhijani, Vinod ;
Roth, G. Thomas ;
Gomes, Xavier ;
Tartaro, Karrie ;
Niazi, Faheem ;
Turcotte, Cynthia L. ;
Irzyk, Gerard P. ;
Lupski, James R. ;
Chinault, Craig ;
Song, Xing-zhi ;
Liu, Yue ;
Yuan, Ye ;
Nazareth, Lynne ;
Qin, Xiang ;
Muzny, Donna M. ;
Margulies, Marcel ;
Weinstock, George M. ;
Gibbs, Richard A. ;
Rothberg, Jonathan M. .
NATURE, 2008, 452 (7189) :872-U5
[47]   Fast and SNP-tolerant detection of complex variants and splicing in short reads [J].
Wu, Thomas D. ;
Nacu, Serban .
BIOINFORMATICS, 2010, 26 (07) :873-881
[48]   Elucidating the inosinome: global approaches to adenosine-to-inosine RNA editing [J].
Wulff, Bjorn-Erik ;
Sakurai, Masayuki ;
Nishikura, Kazuko .
NATURE REVIEWS GENETICS, 2011, 12 (02) :81-85
[49]   A SNP in the ABCC11 gene is the determinant of human earwax type [J].
Yoshiura, K ;
Kinoshita, A ;
Ishida, T ;
Ninokata, A ;
Ishikawa, T ;
Kaname, T ;
Bannai, M ;
Tokunaga, K ;
Sonoda, S ;
Komaki, R ;
Ihara, M ;
Saenko, VA ;
Alipov, GK ;
Sekine, I ;
Komatsu, K ;
Takahashi, H ;
Nakashima, M ;
Sosonkina, N ;
Mapendano, CK ;
Ghadami, M ;
Nomura, M ;
Liang, DS ;
Miwa, N ;
Kim, DK ;
Garidkhuu, A ;
Natsume, N ;
Ohta, T ;
Tomita, H ;
Kaneko, A ;
Kikuchi, M ;
Russomando, G ;
Hirayama, K ;
Ishibashi, M ;
Takahashi, A ;
Saitou, N ;
Murray, JC ;
Saito, S ;
Nakamura, Y ;
Niikawa, N .
NATURE GENETICS, 2006, 38 (03) :324-330
[50]   Cancer resistance in transgenic mice expressing the SAC module of par-4 [J].
Zhao, Yanming ;
Burikhanov, Ravshan ;
Qiu, Shirley ;
Lele, Subodh M. ;
Jennings, C. Darrell ;
Bondada, Subbarao ;
Spear, Brett ;
Rangnekar, Vivek M. .
CANCER RESEARCH, 2007, 67 (19) :9276-9285