Interpretation of Association Signals and Identification of Causal Variants from Genome-wide Association Studies

被引:112
作者
Wang, Kai [1 ]
Dickson, Samuel P. [2 ]
Stolle, Catherine A. [3 ]
Krantz, Ian D. [4 ,5 ]
Goldstein, David B. [2 ]
Hakonarson, Hakon [1 ,4 ,5 ]
机构
[1] Childrens Hosp Philadelphia, Ctr Appl Genom, Philadelphia, PA 19104 USA
[2] Duke Univ, Ctr Human Genome Variat, Inst Genome Sci & Policy, Durham, NC 27708 USA
[3] Childrens Hosp Philadelphia, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
[4] Childrens Hosp Philadelphia, Div Human Genet, Philadelphia, PA 19104 USA
[5] Univ Penn, Dept Pediat, Philadelphia, PA 19104 USA
基金
英国惠康基金; 美国国家卫生研究院;
关键词
LINKAGE-DISEQUILIBRIUM PATTERNS; COMMON DISEASE; RARE VARIANTS; CROHNS-DISEASE; MISSING HERITABILITY; COMPLEX DISEASES; MUTATIONS; GENETICS; SUSCEPTIBILITY; CONTRIBUTE;
D O I
10.1016/j.ajhg.2010.04.003
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
GWAS have been successful in identifying disease susceptibility loci, but it remains a challenge to pinpoint the causal variants in subsequent fine-mapping studies. A conventional fine-mapping effort starts by sequencing dozens of randomly selected samples at susceptibility loci to discover candidate variants, which are then placed on custom arrays or used in imputation algorithms to find the causal variants. We propose that one or several rare or low-frequency causal variants can hitchhike the same common tag SNP, so causal variants may not be easily unveiled by conventional efforts. Here, we first demonstrate that the true effect size and proportion of variance explained by a collection of rare causal variants can be underestimated by a common tag SNP, thereby accounting for some of the "missing heritability" in GWAS. We then describe a case-selection approach based on phasing long-range haplotypes and sequencing cases predicted to harbor causal variants. We compare this approach with conventional strategies on a simulated data set, and we demonstrate its advantages when multiple causal variants are present. We also evaluate this approach in a GWAS on hearing loss, where the most common causal variant has a minor allele frequency (MAF) of 1.3% in the general population and 8.2% in 329 cases. With our case-selection approach, it is present in 88% of the 32 selected cases (MAF = 66%), so sequencing a subset of these cases can readily reveal the causal allele. Our results suggest that thinking beyond common variants is essential in interpreting GWAS signals and identifying causal variants.
引用
收藏
页码:730 / 742
页数:13
相关论文
共 59 条
  • [1] Genetic Mapping in Human Disease
    Altshuler, David
    Daly, Mark J.
    Lander, Eric S.
    [J]. SCIENCE, 2008, 322 (5903) : 881 - 888
  • [2] Genome-wide association study identifies three loci associated with melanoma risk
    Bishop, D. Timothy
    Demenais, Florence
    Iles, Mark M.
    Harland, Mark
    Taylor, John C.
    Corda, Eve
    Randerson-Moor, Juliette
    Aitken, Joanne F.
    Avril, Marie-Francoise
    Azizi, Esther
    Bakker, Bert
    Bianchi-Scarra, Giovanna
    Bressac-de Paillerets, Brigitte
    Calista, Donato
    Cannon-Albright, Lisa A.
    Chin-A-Woeng, Thomas
    Debniak, Tadeusz
    Galore-Haskel, Gilli
    Ghiorzo, Paola
    Gut, Ivo
    Hansson, Johan
    Hocevar, Marko
    Hoiom, Veronica
    Hopper, John L.
    Ingvar, Christian
    Kanetsky, Peter A.
    Kefford, Richard F.
    Landi, Maria Teresa
    Lang, Julie
    Lubinski, Jan
    Mackie, Rona
    Malvehy, Josep
    Mann, Graham J.
    Martin, Nicholas G.
    Montgomery, Grant W.
    van Nieuwpoort, Frans A.
    Novakovic, Srdjan
    Olsson, Hakan
    Puig, Susana
    Weiss, Marjan
    van Workum, Wilbert
    Zelenika, Diana
    Brown, Kevin M.
    Goldstein, Alisa M.
    Gillanders, Elizabeth M.
    Boland, Anne
    Galan, Pilar
    Elder, David E.
    Gruis, Nelleke A.
    Hayward, Nicholas K.
    [J]. NATURE GENETICS, 2009, 41 (08) : 920 - U85
  • [3] Common and rare variants in multifactorial susceptibility to common diseases
    Bodmer, Walter
    Bonilla, Carolina
    [J]. NATURE GENETICS, 2008, 40 (06) : 695 - 701
  • [4] Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
    Burton, Paul R.
    Clayton, David G.
    Cardon, Lon R.
    Craddock, Nick
    Deloukas, Panos
    Duncanson, Audrey
    Kwiatkowski, Dominic P.
    McCarthy, Mark I.
    Ouwehand, Willem H.
    Samani, Nilesh J.
    Todd, John A.
    Donnelly, Peter
    Barrett, Jeffrey C.
    Davison, Dan
    Easton, Doug
    Evans, David
    Leung, Hin-Tak
    Marchini, Jonathan L.
    Morris, Andrew P.
    Spencer, Chris C. A.
    Tobin, Martin D.
    Attwood, Antony P.
    Boorman, James P.
    Cant, Barbara
    Everson, Ursula
    Hussey, Judith M.
    Jolley, Jennifer D.
    Knight, Alexandra S.
    Koch, Kerstin
    Meech, Elizabeth
    Nutland, Sarah
    Prowse, Christopher V.
    Stevens, Helen E.
    Taylor, Niall C.
    Walters, Graham R.
    Walker, Neil M.
    Watkins, Nicholas A.
    Winzer, Thilo
    Jones, Richard W.
    McArdle, Wendy L.
    Ring, Susan M.
    Strachan, David P.
    Pembrey, Marcus
    Breen, Gerome
    St Clair, David
    Caesar, Sian
    Gordon-Smith, Katherine
    Jones, Lisa
    Fraser, Christine
    Green, Elain K.
    [J]. NATURE, 2007, 447 (7145) : 661 - 678
  • [5] Population genetics - making sense out of sequence
    Chakravarti, A
    [J]. NATURE GENETICS, 1999, 21 (Suppl 1) : 56 - 60
  • [6] The genetics of inflammatory bowel disease
    Cho, Judy H.
    Weaver, Casey T.
    [J]. GASTROENTEROLOGY, 2007, 133 (04) : 1327 - 1339
  • [7] Sequence variations in PCSK9, low LDL, and protection against coronary heart disease
    Cohen, JC
    Boerwinkle, E
    Mosley, TH
    Hobbs, HH
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2006, 354 (12) : 1264 - 1272
  • [8] Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels
    Cohen, JC
    Pertsemlidis, A
    Fahmi, S
    Esmail, S
    Vega, GL
    Grundy, SM
    Hobbs, HH
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (06) : 1810 - 1815
  • [9] Multiple rare Alleles contribute to low plasma levels of HDL cholesterol
    Cohen, JC
    Kiss, RS
    Pertsemlidis, A
    Marcel, YL
    McPherson, R
    Hobbs, HH
    [J]. SCIENCE, 2004, 305 (5685) : 869 - 872
  • [10] A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHC
    de Bakker, Paul I. W.
    McVean, Gil
    Sabeti, Pardis C.
    Miretti, Marcos M.
    Green, Todd
    Marchini, Jonathan
    Ke, Xiayi
    Monsuur, Alienke J.
    Whittaker, Pamela
    Delgado, Marcos
    Morrison, Jonathan
    Richardson, Angela
    Walsh, Emily C.
    Gao, Xiaojiang
    Galver, Luana
    Hart, John
    Hafler, David A.
    Pericak-Vance, Margaret
    Todd, John A.
    Daly, Mark J.
    Trowsdale, John
    Wijmenga, Cisca
    Vyse, Tim J.
    Beck, Stephan
    Murray, Sarah Shaw
    Carrington, Mary
    Gregory, Simon
    Deloukas, Panos
    Rioux, John D.
    [J]. NATURE GENETICS, 2006, 38 (10) : 1166 - 1172