Improved imputation quality of low-frequency and rare variants in European samples using the 'Genome of The Netherlands'

被引:72
作者
Deelen, Patrick [1 ,2 ]
Menelaou, Androniki [3 ]
van Leeuwen, Elisabeth M. [4 ]
Kanterakis, Alexandros [1 ,2 ]
van Dijk, Freerk [1 ,2 ]
Medina-Gomez, Carolina [6 ,7 ]
Francioli, Laurent C. [3 ]
Hottenga, Jouke Jan [8 ]
Karssen, Lennart C. [4 ]
Estrada, Karol [6 ,9 ,10 ]
Kreiner-Moller, Eskil [5 ,6 ,11 ,12 ,13 ]
Rivadeneira, Fernando [5 ,6 ,7 ]
van Setten, Jessica [3 ]
Gutierrez-Achury, Javier [1 ]
Westra, Harm-Jan [1 ]
Franke, Lude [1 ]
van Enckevort, David [2 ,14 ]
Dijkstra, Martijn [1 ,2 ]
Byelas, Heorhiy [1 ,2 ]
van Duijn, Cornelia M. [6 ]
de Bakker, Paul I. W. [3 ,15 ,16 ,17 ]
Wijmenga, Cisca [1 ]
Swertz, Morris A. [1 ,2 ]
机构
[1] Univ Groningen, Univ Med Ctr Groningen, Dept Genet, NL-9700 RB Groningen, Netherlands
[2] Univ Groningen, Univ Med Ctr Groningen, Genom Coordinat Ctr, NL-9700 RB Groningen, Netherlands
[3] Univ Med Ctr Utrecht, Dept Med Genet, NL-9700 RB Utrecht, Netherlands
[4] Erasmus MC, Dept Epidemiol, Genet Epidemiol Unit, Rotterdam, Netherlands
[5] Erasmus MC, Dept Internal Med, Rotterdam, Netherlands
[6] Erasmus Univ, Med Ctr, Dept Epidemiol, Rotterdam, Netherlands
[7] Netherlands Consortium Hlth Aging, Netherlands Genom Initiat, Rotterdam, Netherlands
[8] Vrije Univ Amsterdam, Dept Biol Psychol, Amsterdam, Netherlands
[9] Massachusetts Gen Hosp, Analyt & Translat Genet Unit, Dept Med, Boston, MA 02114 USA
[10] Harvard Univ, Massachusetts Gen Hosp, Sch Med, Boston, MA USA
[11] COPSAC, Copenhagen, Denmark
[12] Copenhagen Prospect Studies Asthma Childhood, Copenhagen, Denmark
[13] Univ Copenhagen, Fac Hlth Sci, Copenhagen, Denmark
[14] Netherlands Bioinformat Ctr, NBIC BioAssist, Nijmegen, Netherlands
[15] Univ Med Ctr Utrecht, Dept Epidemiol, Utrecht, Netherlands
[16] Harvard Univ, Brigham & Womens Hosp, Sch Med, Div Genet, Boston, MA 02115 USA
[17] Broad Inst Harvard & MIT, Cambridge, MA USA
关键词
genotype imputation; GWAS; GoNL; rare variants; reference sets; reference panel; ASSOCIATION; COMMON; DISEASE; ARRAY; POWER; LOCI;
D O I
10.1038/ejhg.2014.19
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with 'true' genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed significant improvement in the imputation quality for rare variants (MAF 0.05-0.5%) compared with 1000G. In Dutch samples, the mean observed Pearson correlation, r(2), increased from 0.61 to 0.71. We also saw improved imputation accuracy for other European populations (in the British samples, r(2) improved from 0.58 to 0.65, and in the Italians from 0.43 to 0.47). A combined reference set comprising 1000G and GoNL improved the imputation of rare variants even further. The Italian samples benefitted the most from this combined reference (the mean r(2) increased from 0.47 to 0.50). We conclude that the creation of a large population-specific reference is advantageous for imputing rare variants and that a combined reference panel across multiple populations yields the best imputation results.
引用
收藏
页码:1321 / 1326
页数:6
相关论文
共 26 条
[1]  
[Anonymous], 2018, R software: Version 3.5.1, DOI [DOI 10.1007/978-3-540-74686-7, 10.1007/978-3-540-74686-7]
[2]   The Genome of the Netherlands: design, and project goals [J].
Boomsma, Dorret I. ;
Wijmenga, Cisca ;
Slagboom, Eline P. ;
Swertz, Morris A. ;
Karssen, Lennart C. ;
Abdellaoui, Abdel ;
Ye, Kai ;
Guryev, Victor ;
Vermaat, Martijn ;
van Dijk, Freerk ;
Francioli, Laurent C. ;
Hottenga, Jouke Jan ;
Laros, Jeroen F. J. ;
Li, Qibin ;
Li, Yingrui ;
Cao, Hongzhi ;
Chen, Ruoyan ;
Du, Yuanping ;
Li, Ning ;
Cao, Sujie ;
van Setten, Jessica ;
Menelaou, Androniki ;
Pulit, Sara L. ;
Hehir-Kwa, Jayne Y. ;
Beekman, Marian ;
Elbers, Clara C. ;
Byelas, Heorhiy ;
de Craen, Anton J. M. ;
Deelen, Patrick ;
Dijkstra, Martijn ;
den Dunnen, Johan T. ;
de Knijff, Peter ;
Houwing-Duistermaat, Jeanine ;
Koval, Vyacheslav ;
Estrada, Karol ;
Hofman, Albert ;
Kanterakis, Alexandros ;
van Enckevort, David ;
Mai, Hailiang ;
Kattenberg, Mathijs ;
van Leeuwen, Elisabeth M. ;
Neerincx, Pieter B. T. ;
Oostra, Ben ;
Rivadeneira, Fernanodo ;
Suchiman, Eka H. D. ;
Uitterlinden, Andre G. ;
Willemsen, Gonneke ;
Wolffenbuttel, Bruce H. ;
Wang, Jun ;
de Bakker, Paul I. W. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2014, 22 (02) :221-227
[3]  
Byelas H, P 5 INT WORKSH SCI G
[4]   Uncovering the roles of rare variants in common disease through whole-genome sequencing [J].
Cirulli, Elizabeth T. ;
Goldstein, David B. .
NATURE REVIEWS GENETICS, 2010, 11 (06) :415-425
[5]   Promise and pitfalls of the Immunochip [J].
Cortes, Adrian ;
Brown, Matthew A. .
ARTHRITIS RESEARCH & THERAPY, 2011, 13 (01)
[6]   Efficiency and power in genetic association studies [J].
de Bakker, PIW ;
Yelensky, R ;
Pe'er, I ;
Gabriel, SB ;
Daly, MJ ;
Altshuler, D .
NATURE GENETICS, 2005, 37 (11) :1217-1223
[7]   Efficiency and Power as a Function of Sequence Coverage, SNP Array Density, and Imputation [J].
Flannick, Jason ;
Korn, Joshua M. ;
Fontanillas, Pierre ;
Grant, George B. ;
Banks, Eric ;
Depristo, Mark A. ;
Altshuler, David .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (07)
[8]   Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies [J].
Hao, Ke ;
Chudin, Eugene ;
McElwee, Joshua ;
Schadt, Eric E. .
BMC GENETICS, 2009, 10
[9]   Potential etiologic and functional implications of genome-wide association loci for human diseases and traits [J].
Hindorff, Lucia A. ;
Sethupathy, Praveen ;
Junkins, Heather A. ;
Ramos, Erin M. ;
Mehta, Jayashri P. ;
Collins, Francis S. ;
Manolio, Teri A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (23) :9362-9367
[10]   A rare variant in MYH6 is associated with high risk of sick sinus syndrome [J].
Holm, Hilma ;
Gudbjartsson, Daniel F. ;
Sulem, Patrick ;
Masson, Gisli ;
Helgadottir, Hafdis Th ;
Zanon, Carlo ;
Magnusson, Olafur Th ;
Helgason, Agnar ;
Saemundsdottir, Jona ;
Gylfason, Arnaldur ;
Stefansdottir, Hrafnhildur ;
Gretarsdottir, Solveig ;
Matthiasson, Stefan E. ;
Thorgeirsson, Guomundur ;
Jonasdottir, Aslaug ;
Sigurdsson, Asgeir ;
Stefansson, Hreinn ;
Werge, Thomas ;
Rafnar, Thorunn ;
Kiemeney, Lambertus A. ;
Parvez, Babar ;
Muhammad, Raafia ;
Roden, Dan M. ;
Darbar, Dawood ;
Thorleifsson, Gudmar ;
Walters, G. Bragi ;
Kong, Augustine ;
Thorsteinsdottir, Unnur ;
Arnar, David O. ;
Stefansson, Kari .
NATURE GENETICS, 2011, 43 (04) :316-U148