Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals

被引:110
作者
Ju, Young Seok [1 ,2 ]
Kim, Jong-Il [1 ,3 ,4 ,5 ]
Kim, Sheehyun [1 ,2 ]
Hong, Dongwan [1 ]
Park, Hansoo [1 ,6 ,7 ]
Shin, Jong-Yeon [1 ,5 ]
Lee, Seungbok [1 ,4 ]
Lee, Won-Chul [1 ,4 ]
Kim, Sujung [5 ]
Yu, Saet-Byeol [5 ]
Park, Sung-Soo [5 ]
Seo, Seung-Hyun [5 ]
Yun, Ji-Young [5 ]
Kim, Hyun-Jin [1 ,4 ]
Lee, Dong-Sung [1 ,4 ]
Yavartanoo, Maryam [1 ,4 ]
Kang, Hyunseok Peter [1 ]
Gokcumen, Omer [6 ,7 ]
Govindaraju, Diddahally R. [6 ,7 ]
Jung, Jung Hee [2 ]
Chong, Hyonyong [2 ,8 ]
Yang, Kap-Seok [2 ]
Kim, Hyungtae [2 ]
Lee, Charles [6 ,7 ]
Seo, Jeong-Sun [1 ,2 ,3 ,4 ,5 ,8 ]
机构
[1] Seoul Natl Univ, Med Res Ctr, GMI, Seoul, South Korea
[2] Macrogen Inc, Seoul, South Korea
[3] Seoul Natl Univ, Coll Med, Dept Biochem, Seoul, South Korea
[4] Seoul Natl Univ, Grad Sch, Dept Biomed Sci, Seoul, South Korea
[5] Psoma Therapeut Inc, Seoul, South Korea
[6] Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[7] Harvard Univ, Sch Med, Boston, MA USA
[8] Axeq Technol, Rockville, MD USA
基金
美国国家卫生研究院;
关键词
GENE-EXPRESSION; STRUCTURAL VARIANTS; EDITING SITES; FAMILY; COMMON; SNP;
D O I
10.1038/ng.872
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Massively parallel sequencing technologies have identified a broad spectrum of human genome diversity. Here we deep sequenced and correlated 18 genomes and 17 transcriptomes of unrelated Korean individuals. This has allowed us to construct a genome-wide map of common and rare variants and also identify variants formed during DNA-RNA transcription. We identified 9.56 million genomic variants, 23.2% of which appear to be previously unidentified. From transcriptome sequencing, we discovered 4,414 transcripts not previously annotated. Finally, we revealed 1,809 sites of transcriptional base modification, where the transcriptional landscape is different from the corresponding genomic sequences, and 580 sites of allele-specific expression. Our findings suggest that a considerable number of unexplored genomic variants still remain to be identified in the human genome, and that the integrated analysis of genome and transcriptome sequencing is powerful for understanding the diversity and functional aspects of human genomic variants.
引用
收藏
页码:745 / U47
页数:10
相关论文
共 50 条
[1]   Limitations of next-generation genome sequence assembly [J].
Alkan, Can ;
Sajjadian, Saba ;
Eichler, Evan E. .
NATURE METHODS, 2011, 8 (01) :61-65
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[4]  
Bailey TL., 1994, Proc Int Conf Intel Syst Mol Biol, V2, P28
[5]   Genome, epigenome and RNA sequences of monozygotic twins discordant for multiple sclerosis [J].
Baranzini, Sergio E. ;
Mudge, Joann ;
van Velkinburgh, Jennifer C. ;
Khankhanian, Pouya ;
Khrebtukova, Irina ;
Miller, Neil A. ;
Zhang, Lu ;
Farmer, Andrew D. ;
Bell, Callum J. ;
Kim, Ryan W. ;
May, Gregory D. ;
Woodward, Jimmy E. ;
Caillier, Stacy J. ;
McElroy, Joseph P. ;
Gomez, Refujia ;
Pando, Marcelo J. ;
Clendenen, Leonda E. ;
Ganusova, Elena E. ;
Schilkey, Faye D. ;
Ramaraj, Thiruvarangan ;
Khan, Omar A. ;
Huntley, Jim J. ;
Luo, Shujun ;
Kwok, Pui-yan ;
Wu, Thomas D. ;
Schroth, Gary P. ;
Oksenberg, Jorge R. ;
Hauser, Stephen L. ;
Kingsmore, Stephen F. .
NATURE, 2010, 464 (7293) :1351-U6
[6]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[7]   X-inactivation profile reveals extensive variability in X-linked gene expression in females [J].
Carrel, L ;
Willard, HF .
NATURE, 2005, 434 (7031) :400-404
[8]   Mutation spectrum revealed by breakpoint sequencing of human germline CNVs [J].
Conrad, Donald F. ;
Bird, Christine ;
Blackburne, Ben ;
Lindsay, Sarah ;
Mamanova, Lira ;
Lee, Charles ;
Turner, Daniel J. ;
Hurles, Matthew E. .
NATURE GENETICS, 2010, 42 (05) :385-U43
[9]   The AID/APOBEC family of nucleic acid mutators [J].
Conticello, Silvestro G. .
GENOME BIOLOGY, 2008, 9 (06)
[10]   Polymorphisms of alpha-adducin and salt sensitivity in patients with essential hypertension [J].
Cusi, D ;
Barlassina, C ;
Azzani, T ;
Casari, G ;
Citterio, L ;
Devoto, M ;
Glorioso, N ;
Lanzani, C ;
Manunta, P ;
Righetti, M ;
Rivera, R ;
Stella, P ;
Troffa, C ;
Zagato, L ;
Bianchi, G .
LANCET, 1997, 349 (9062) :1353-1357