Mapping and characterization of structural variation in 17,795 human genomes

被引:188
作者
Abel, Haley J. [1 ,2 ]
Larson, David E. [1 ,2 ]
Regier, Allison A. [1 ,3 ]
Chiang, Colby [1 ]
Das, Indraniel [1 ]
Kanchi, Krishna L. [1 ]
Layer, Ryan M. [4 ,5 ]
Neale, Benjamin M. [6 ,7 ,8 ]
Salerno, William J. [9 ]
Reeves, Catherine [10 ]
Buyske, Steven [11 ]
Matise, Tara C. [12 ]
Muzny, Donna M. [9 ]
Zody, Michael C. [10 ]
Lander, Eric S. [6 ,13 ,14 ]
Dutcher, Susan K. [1 ,2 ]
Stitziel, Nathan O. [1 ,2 ,3 ]
Hall, Ira M. [1 ,2 ,3 ]
机构
[1] Washington Univ, Sch Med, McDonnell Genome Inst, St Louis, MO 63110 USA
[2] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63110 USA
[3] Washington Univ, Sch Med, Dept Med, St Louis, MO 63110 USA
[4] Univ Colorado, BioFrontiers Inst, Boulder, CO 80309 USA
[5] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[6] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
[7] Broad Inst MIT & Harvard, Stanley Ctr Psychiat Res, Cambridge, MA 02142 USA
[8] Massachusetts Gen Hosp, Analyt & Translat Genet Unit, Boston, MA 02114 USA
[9] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
[10] New York Genome Ctr, New York, NY USA
[11] Rutgers State Univ, Dept Stat, Piscataway, NJ USA
[12] Rutgers State Univ, Dept Genet, Piscataway, NJ USA
[13] MIT, Dept Biol, Cambridge, MA USA
[14] Harvard Med Sch, Dept Syst Biol, Boston, MA 02115 USA
基金
芬兰科学院;
关键词
COPY NUMBER VARIATION; COMMUNITY-DRIVEN RESOURCE; WIDE ASSOCIATION; VARIANTS; IMPACT; ELEMENTS; DATABASE; NUCLEOTIDE; PREDICTION; FRAMEWORK;
D O I
10.1038/s41586-020-2371-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Structural variants in more than 17,000 human genomes are mapped and characterized using whole-genome sequencing, showing how this type of variation contributes to rare deleterious coding and noncoding alleles. A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline(1)to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.
引用
收藏
页码:83 / +
页数:26
相关论文
共 73 条
[1]
CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing [J].
Abyzov, Alexej ;
Urban, Alexander E. ;
Snyder, Michael ;
Gerstein, Mark .
GENOME RESEARCH, 2011, 21 (06) :974-984
[2]
Analysis of variable retroduplications in human populations suggests coupling of retrotransposition to cell division [J].
Abyzov, Alexej ;
Iskow, Rebecca ;
Gokcumen, Omer ;
Radke, David W. ;
Balasubramanian, Suganthi ;
Pei, Baikang ;
Habegger, Lukas ;
Lee, Charles ;
Gerstein, Mark .
GENOME RESEARCH, 2013, 23 (12) :2042-2052
[3]
A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[4]
DENdb: database of integrated human enhancers [J].
Ashoor, Haitham ;
Kleftogiannis, Dimitrios ;
Radovanovic, Aleksandar ;
Bajic, Vladimir B. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2015,
[5]
Ultraconserved elements in the human genome [J].
Bejerano, G ;
Pheasant, M ;
Makunin, I ;
Stephen, S ;
Kent, WJ ;
Mattick, JS ;
Haussler, D .
SCIENCE, 2004, 304 (5675) :1321-1325
[6]
DECIPHER: database for the interpretation of phenotype-linked plausibly pathogenic sequence and copy-number variation [J].
Bragin, Eugene ;
Chatzimichali, Eleni A. ;
Wright, Caroline F. ;
Hurles, Matthew E. ;
Firth, Helen V. ;
Bevan, A. Paul ;
Swaminathan, G. Jawahar .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D993-D1000
[7]
Paternally inherited cis-regulatory structural variants are associated with autism [J].
Brandler, William M. ;
Antaki, Danny ;
Gujral, Madhusudan ;
Kleiber, Morgan L. ;
Whitney, Joe ;
Maile, Michelle S. ;
Hong, Oanh ;
Chapman, Timothy R. ;
Tan, Shirley ;
Tandon, Prateek ;
Pang, Timothy ;
Tang, Shih C. ;
Vaux, Keith K. ;
Yang, Yan ;
Harrington, Eoghan ;
Juul, Sissel ;
Turner, Daniel J. ;
Thiruvahindrapuram, Bhooma ;
Kaur, Gaganjot ;
Wang, Zhuozhi ;
Kingsmore, Stephen F. ;
Gleeson, Joseph G. ;
Bisson, Denis ;
Kakaradov, Boyko ;
Telenti, Amalio ;
Venter, J. Craig ;
Corominas, Roser ;
Toma, Claudio ;
Cormand, Bru ;
Rueda, Isabel ;
Guijarro, Silvina ;
Messer, Karen S. ;
Nievergelt, Caroline M. ;
Arranz, Maria J. ;
Courchesne, Eric ;
Pierce, Karen ;
Muotri, Alysson R. ;
Iakoucheva, Lilia M. ;
Hervas, Amaia ;
Scherer, Stephen W. ;
Corsello, Christina ;
Sebat, Jonathan .
SCIENCE, 2018, 360 (6386) :327-330
[8]
Multi-platform discovery of haplotype-resolved structural variation in human genomes [J].
Chaisson, Mark J. P. ;
Sanders, Ashley D. ;
Zhao, Xuefang ;
Malhotra, Ankit ;
Porubsky, David ;
Rausch, Tobias ;
Gardner, Eugene J. ;
Rodriguez, Oscar L. ;
Guo, Li ;
Collins, Ryan L. ;
Fan, Xian ;
Wen, Jia ;
Handsaker, Robert E. ;
Fairley, Susan ;
Kronenberg, Zev N. ;
Kong, Xiangmeng ;
Hormozdiari, Fereydoun ;
Lee, Dillon ;
Wenger, Aaron M. ;
Hastie, Alex R. ;
Antaki, Danny ;
Anantharaman, Thomas ;
Audano, Peter A. ;
Brand, Harrison ;
Cantsilieris, Stuart ;
Cao, Han ;
Cerveira, Eliza ;
Chen, Chong ;
Chen, Xintong ;
Chin, Chen-Shan ;
Chong, Zechen ;
Chuang, Nelson T. ;
Lambert, Christine C. ;
Church, Deanna M. ;
Clarke, Laura ;
Farrell, Andrew ;
Flores, Joey ;
Galeev, Timur ;
Gorkin, David U. ;
Gujral, Madhusudan ;
Guryev, Victor ;
Heaton, William Haynes ;
Korlach, Jonas ;
Kumar, Sushant ;
Kwon, Jee Young ;
Lam, Ernest T. ;
Lee, Jong Eun ;
Lee, Joyce ;
Lee, Wan-Ping ;
Lee, Sau Peng .
NATURE COMMUNICATIONS, 2019, 10 (1)
[9]
The impact of structural variation on human gene expression [J].
Chiang, Colby ;
Scott, Alexandra J. ;
Davis, Joe R. ;
Tsang, Emily K. ;
Li, Xin ;
Kim, Yungil ;
Hadzic, Tarik ;
Damani, Farhan N. ;
Ganel, Liron ;
Montgomery, Stephen B. ;
Battle, Alexis ;
Conrad, Donald F. ;
Hall, Ira M. .
NATURE GENETICS, 2017, 49 (05) :692-+
[10]
Chiang C, 2015, NAT METHODS, V12, P966, DOI [10.1038/NMETH.3505, 10.1038/nmeth.3505]