The mutational constraint spectrum quantified from variation in 141,456 humans

被引:6799
作者
Karczewski, Konrad J. [1 ,2 ]
Francioli, Laurent C. [1 ,2 ]
Tiao, Grace [1 ,2 ]
Cummings, Beryl B. [1 ,2 ,3 ]
Alfoldi, Jessica [1 ,2 ]
Wang, Qingbo [1 ,2 ,4 ]
Collins, Ryan L. [1 ,4 ,5 ]
Laricchia, Kristen M. [1 ,2 ]
Ganna, Andrea [1 ,2 ,6 ]
Birnbaum, Daniel P. [1 ,2 ]
Gauthier, Laura D. [7 ]
Brand, Harrison [1 ,5 ]
Solomonson, Matthew [1 ,2 ]
Watts, Nicholas A. [1 ,2 ]
Rhodes, Daniel [8 ,9 ]
Singer-Berk, Moriel [1 ,2 ]
England, Eleina M. [1 ,2 ]
Seaby, Eleanor G. [1 ,2 ]
Kosmicki, Jack A. [1 ,2 ,4 ]
Walters, Raymond K. [1 ,2 ,10 ]
Tashman, Katherine [1 ,2 ,10 ]
Farjoun, Yossi [7 ]
Banks, Eric [7 ]
Poterba, Timothy [1 ,2 ,10 ]
Wang, Arcturus [1 ,2 ,10 ]
Seed, Cotton [1 ,2 ,10 ]
Whiffin, Nicola [1 ,2 ,11 ,12 ,13 ]
Chong, Jessica X. [14 ]
Samocha, Kaitlin E. [15 ]
Pierce-Hoffman, Emma [1 ,2 ]
Zappala, Zachary [1 ,2 ,16 ]
O'Donnell-Luria, Anne H. [1 ,2 ,17 ,18 ]
Minikel, Eric Vallabh [1 ]
Weisburd, Ben [7 ]
Lek, Monkol [19 ]
Ware, James S. [1 ,11 ,12 ,13 ]
Vittal, Christopher [2 ,10 ]
Armean, Irina M. [1 ,2 ]
Bergelson, Louis [7 ]
Cibulskis, Kristian [7 ]
Connolly, Kristen M. [20 ]
Covarrubias, Miguel [7 ]
Donnelly, Stacey [1 ]
Ferriera, Steven [20 ]
Gabriel, Stacey [20 ]
Gentry, Jeff [7 ]
Gupta, Namrata [1 ,20 ]
Jeandet, Thibault [7 ]
Kaplan, Diane [7 ]
Llanwarne, Christopher [7 ]
机构
[1] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02142 USA
[2] Massachusetts Gen Hosp, Analyt & Translat Genet Unit, Boston, MA 02114 USA
[3] Harvard Med Sch, Program Biol & Biomed Sci, Boston, MA 02115 USA
[4] Harvard Med Sch, Program Bioinformat & Integrat Genom, Boston, MA 02115 USA
[5] Massachusetts Gen Hosp, Ctr Genom Med, Boston, MA 02114 USA
[6] Inst Mol Med Finland, Helsinki, Finland
[7] Broad Inst MIT & Harvard, Data Sci Platform, Cambridge, MA 02142 USA
[8] Queen Mary Univ, Barts & London Sch Med & Dent, William Harvey Res Inst, Ctr Translat Bioinformat, London, England
[9] Barts Hearth NHS Trust, London, England
[10] Broad Inst MIT & Harvard, Stanley Ctr Psychiat Res, Cambridge, MA 02142 USA
[11] Imperial Coll London, Natl Heart & Lung Inst, London, England
[12] Imperial Coll London, MRC London Inst Med Sci, London, England
[13] Royal Brompton & Harefierd Hosp NHS Trust, Cardiovasc Res Ctr, London, England
[14] Univ Washington, Dept Pediat, Seattle, WA 98195 USA
[15] Wellcome Sanger Inst, Wellcome Genome Campus, Cambridge, England
[16] Vertex Pharmaceut Inc, Boston, MA USA
[17] Boston Childrens Hosp, Div Genet & Genom, Boston, MA USA
[18] Harvard Med Sch, Dept Pediat, Boston, MA 02115 USA
[19] Yale Sch Med, Dept Genet, New Haven, CT USA
[20] Broad Inst MIT & Harvard, Broad Genom, Cambridge, MA 02142 USA
[21] Harvard Med Sch, Dept Neurol, Boston, MA 02115 USA
[22] Garvan Inst Med Res, Ctr Populat Genom, Sydney, NSW, Australia
[23] UNSW Sydney, Sydney, NSW, Australia
[24] Murdoch Childrens Res Inst, Ctr Populat Genom, Melbourne, Vic, Australia
基金
英国惠康基金; 英国医学研究理事会; 瑞士国家科学基金会;
关键词
DE-NOVO MUTATIONS; VARIANTS; BURDEN; MODEL;
D O I
10.1038/s41586-020-2308-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes(1). Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.
引用
收藏
页码:434 / +
页数:22
相关论文
共 51 条
[1]
Genetic effects on gene expression across human tissues [J].
Aguet, Francois ;
Brown, Andrew A. ;
Castel, Stephane E. ;
Davis, Joe R. ;
He, Yuan ;
Jo, Brian ;
Mohammadi, Pejman ;
Park, Yoson ;
Parsana, Princy ;
Segre, Ayellet V. ;
Strober, Benjamin J. ;
Zappala, Zachary ;
Cummings, Beryl B. ;
Gelfand, Ellen T. ;
Hadley, Kane ;
Huang, Katherine H. ;
Lek, Monkol ;
Li, Xiao ;
Nedzel, Jared L. ;
Nguyen, Duyen Y. ;
Noble, Michael S. ;
Sullivan, Timothy J. ;
Tukiainen, Taru ;
MacArthur, Daniel G. ;
Getz, Gad ;
Management, Nih Program ;
Addington, Anjene ;
Guan, Ping ;
Koester, Susan ;
Little, A. Roger ;
Lockhart, Nicole C. ;
Moore, Helen M. ;
Rao, Abhi ;
Struewing, Jeffery P. ;
Volpi, Simona ;
Collection, Biospecimen ;
Brigham, Lori E. ;
Hasz, Richard ;
Hunter, Marcus ;
Johns, Christopher ;
Johnson, Mark ;
Kopen, Gene ;
Leinweber, William F. ;
Lonsdale, John T. ;
McDonald, Alisa ;
Mestichelli, Bernadette ;
Myer, Kevin ;
Roe, Bryan ;
Salvatore, Michael ;
Shad, Saboor .
NATURE, 2017, 550 (7675) :204-+
[2]
[Anonymous], 2013, Curr. Protoc. Bioinform.
[3]
Mendelian Gene Discovery: Fast and Furious with No End in Sight [J].
Bamshad, Michael J. ;
Nickerson, Deborah A. ;
Chong, Jessica X. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2019, 105 (03) :448-455
[4]
Estimating the selective effects of heterozygous protein-truncating variants from human exome data [J].
Cassa, Christopher A. ;
Weghorn, Donate ;
Balick, Daniel J. ;
Jordan, Daniel M. ;
Nusinow, David ;
Samocha, Kaitlin E. ;
O'Donnell-Luria, Anne ;
MacArthur, Daniel G. ;
Daly, Mark J. ;
Beier, David R. ;
Sunyaev, Shamil R. .
NATURE GENETICS, 2017, 49 (05) :806-+
[5]
The Genetic Basis of Mendelian Phenotypes: Discoveries, Challenges, and Opportunities [J].
Chong, Jessica X. ;
Buckingham, Kati J. ;
Jhangiani, Shalini N. ;
Boehm, Corinne ;
Sobreira, Nara ;
Smith, Joshua D. ;
Harrell, Tanya M. ;
McMillin, Margaret J. ;
Wiszniewski, Wojciech ;
Gambin, Tomasz ;
Akdemir, Zeynep H. Coban ;
Doheny, Kimberly ;
Scott, Alan F. ;
Avramopoulos, Dimitri ;
Chakravarti, Aravinda ;
Hoover-Fong, Julie ;
Mathews, Debra ;
Witmer, P. Dane ;
Ling, Hua ;
Hetrick, Kurt ;
Watkins, Lee ;
Patterson, Karynne E. ;
Reinier, Frederic ;
Blue, Elizabeth ;
Muzny, Donna ;
Kircher, Martin ;
Bilguvar, Kaya ;
Lopez-Giraldez, Francesc ;
Sutton, V. Reid ;
Tabor, Holly K. ;
Lea, Suzanne M. ;
Gune, Murat ;
Mane, Shrikant ;
Gibbs, Richard A. ;
Boerwinkle, Eric ;
Hamosh, Ada ;
Shendure, Jay ;
Lupski, James R. ;
Lifton, Richard P. ;
Valle, David ;
Nickerson, Deborah A. ;
Bamshad, Michael J. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2015, 97 (02) :199-215
[6]
Sequence variations in PCSK9, low LDL, and protection against coronary heart disease [J].
Cohen, JC ;
Boerwinkle, E ;
Mosley, TH ;
Hobbs, HH .
NEW ENGLAND JOURNAL OF MEDICINE, 2006, 354 (12) :1264-1272
[7]
A structural variation reference for medical and population genetics [J].
Collins, Ryan L. ;
Brand, Harrison ;
Karczewski, Konrad J. ;
Zhao, Xuefang ;
Alfoldi, Jessica ;
Francioli, Laurent C. ;
Khera, Amit, V ;
Lowther, Chelsea ;
Gauthier, Laura D. ;
Wang, Harold ;
Watts, Nicholas A. ;
Solomonson, Matthew ;
O'Donnell-Luria, Anne ;
Baumann, Alexander ;
Munshi, Ruchi ;
Walker, Mark ;
Whelan, Christopher W. ;
Huang, Yongqing ;
Brookings, Ted ;
Sharpe, Ted ;
Stone, Matthew R. ;
Valkanas, Elise ;
Fu, Jack ;
Tiao, Grace ;
Laricchia, Kristen M. ;
Ruano-Rubio, Valentin ;
Stevens, Christine ;
Gupta, Namrata ;
Cusick, Caroline ;
Margolin, Lauren ;
Taylor, Kent D. ;
Lin, Henry J. ;
Rich, Stephen S. ;
Post, Wendy S. ;
Chen, Yii-Der Ida ;
Rotter, Jerome, I ;
Nusbaum, Chad ;
Philippakis, Anthony ;
Lander, Eric ;
Gabriel, Stacey ;
Neale, Benjamin M. ;
Kathiresan, Sekar ;
Daly, Mark J. ;
Banks, Eric ;
MacArthur, Daniel G. ;
Talkowski, Michael E. .
NATURE, 2020, 581 (7809) :444-+
[8]
Transcript expression-aware annotation improves rare variant interpretation [J].
Cummings, Beryl B. ;
Karczewski, Konrad J. ;
Kosmicki, Jack A. ;
Seaby, Eleanor G. ;
Watts, Nicholas A. ;
Singer-Berk, Moriel ;
Mudge, Jonathan M. ;
Karjalainen, Juha ;
Satterstrom, F. Kyle ;
O'Donnell-Luria, Anne H. ;
Poterba, Timothy ;
Seed, Cotton ;
Solomonson, Matthew ;
Alfoldi, Jessica ;
Daly, Mark J. ;
MacArthur, Daniel G. .
NATURE, 2020, 581 (7809) :452-+
[9]
Variable population prevalence estimates of germline TP53 variants: A gnomAD-based analysis [J].
de Andrade, Kelvin C. ;
Frone, Megan N. ;
Wegman-Ostrosky, Talia ;
Khincha, Payal P. ;
Kim, Jung ;
Amadou, Amina ;
Santiago, Karina M. ;
Fortes, Fernanda P. ;
Lemonnier, Nathanal ;
Mirabello, Lisa ;
Stewart, Douglas R. ;
Hainaut, Pierre ;
Kowalski, Luiz P. ;
Savage, Sharon A. ;
Achatz, Maria I. .
HUMAN MUTATION, 2019, 40 (01) :97-105
[10]
Medical relevance of protein-truncating variants across 337,205 individuals in the UK Biobank study [J].
DeBoever, Christopher ;
Tanigawa, Yosuke ;
Lindholm, Malene E. ;
McInnes, Greg ;
Lavertu, Adam ;
Ingelsson, Erik ;
Chang, Chris ;
Ashley, Euan A. ;
Bustamante, Carlos D. ;
Daly, Mark J. ;
Rivas, Manuel A. .
NATURE COMMUNICATIONS, 2018, 9