PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations

被引:753
作者
Denny, Joshua C. [1 ,2 ]
Ritchie, Marylyn D. [3 ]
Basford, Melissa A. [1 ]
Pulley, Jill M. [1 ,2 ]
Bastarache, Lisa [1 ]
Brown-Gentry, Kristin [3 ]
Wang, Deede [2 ]
Masys, Dan R. [1 ]
Roden, Dan M. [2 ]
Crawford, Dana C. [3 ]
机构
[1] Vanderbilt Univ, Dept Biomed Informat, Nashville, TN 37203 USA
[2] Vanderbilt Univ, Dept Med, Nashville, TN USA
[3] Vanderbilt Univ, Dept Mol Physiol & Biophys, Ctr Human Genet Res, Sch Med, Nashville, TN 37232 USA
基金
美国国家卫生研究院;
关键词
RISK; DIAGNOSIS; CHILDREN;
D O I
10.1093/bioinformatics/btq126
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Emergence of genetic data coupled to longitudinal electronic medical records (EMRs) offers the possibility of phenome-wide association scans (PheWAS) for disease-gene associations. We propose a novel method to scan phenomic data for genetic associations using International Classification of Disease (ICD9) billing codes, which are available in most EMR systems. We have developed a code translation table to automatically de. ne 776 different disease populations and their controls using prevalent ICD9 codes derived from EMR data. As a proof of concept of this algorithm, we genotyped the first 6005 European-Americans accrued into BioVU, Vanderbilt's DNA biobank, at five single nucleotide polymorphisms (SNPs) with previously reported disease associations: atrial fibrillation, Crohn's disease, carotid artery stenosis, coronary artery disease, multiple sclerosis, systemic lupus erythematosus and rheumatoid arthritis. The PheWAS software generated cases and control populations across all ICD9 code groups for each of these five SNPs, and disease-SNP associations were analyzed. The primary outcome of this study was replication of seven previously known SNP-disease associations for these SNPs. Results: Four of seven known SNP-disease associations using the PheWAS algorithm were replicated with P-values between 2.8 x 10(-6) and 0.011. The PheWAS algorithm also identified 19 previously unknown statistical associations between these SNPs and diseases at P < 0.01. This study indicates that PheWAS analysis is a feasible method to investigate SNP-disease associations. Further evaluation is needed to determine the validity of these associations and the appropriate statistical thresholds for clinical significance.
引用
收藏
页码:1205 / 1210
页数:6
相关论文
共 23 条
  • [1] Large Scale Association Analysis of Novel Genetic Loci for Coronary Artery Disease
    Amouyel, Philippe
    Arveiler, Dominique
    Boekholdt, S. Matthijs
    Braund, Peter
    Bruse, Petra
    Bumpstead, Suzannah J.
    Bugert, Peter
    Cambien, Francois
    Danesh, John
    Deloukas, Panos
    Doering, Angela
    Ducimetiere, Pierre
    Dunn, Ruth M.
    El Mokhtari, Nour-Eddine
    Erdmann, Jeanette
    Evans, Alun
    Ewels, Phil
    Ferrieres, Jean
    Fischer, Marcus
    Frossard, Philippe
    Garner, Stephen
    Gieger, Christian
    Gohri, Mohammed J. R.
    Goodall, Alison H.
    Grosshennig, Anika
    Hall, Alistair
    Hardwick, Rob
    Haukijarvi, Ari
    Hengstenberg, Christian
    Illig, Thomas
    Karvanen, Juha
    Kastelein, John
    Kee, Frank
    Khaw, Kay-Tee
    Klueter, Harald
    Koenig, Inke R.
    Kuulasmaa, Kari
    Laiho, Paivi
    Luc, Gerald
    Maerz, Winfried
    McGinnis, Ralph
    McLaren, William
    Meisinger, Christa
    Morrison, Caroline
    Ou, Xiodan
    Ouwehand, Willem H.
    Preuss, Michael
    Proust, Carole
    Ravindrarajah, Radhi
    Renner, Wilfried
    [J]. ARTERIOSCLEROSIS THROMBOSIS AND VASCULAR BIOLOGY, 2009, 29 (05) : 774 - U356
  • [2] Genome-wide association with select biomarker traits in the Framingham Heart Study
    Benjamin, Emelia J.
    Dupuis, Josee
    Larson, Martin G.
    Lunetta, Kathryn L.
    Booth, Sarah L.
    Govindaraju, Diddahally R.
    Kathiresan, Sekar
    Keaney, John F., Jr.
    Keyes, Michelle J.
    Lin, Jing-Ping
    Meigs, James B.
    Robins, Sander J.
    Rong, Jian
    Schnabel, Renate
    Vita, Joseph A.
    Wang, Thomas J.
    Wilson, Peter W. F.
    Wolf, Philip A.
    Vasan, Ramachandran S.
    [J]. BMC MEDICAL GENETICS, 2007, 8
  • [3] Trends in the Diagnosis of Overweight and Obesity in Children and Adolescents: 1999-2007
    Benson, Lacey
    Baer, Heather J.
    Kaelber, David C.
    [J]. PEDIATRICS, 2009, 123 (01) : E153 - E158
  • [4] PHENOMICS: THE SYSTEMATIC STUDY OF PHENOTYPES ON A GENOME-WIDE SCALE
    Bilder, R. M.
    Sabb, F. W.
    Cannon, T. D.
    London, E. D.
    Jentsch, J. D.
    Parker, D. Stott
    Poldrack, R. A.
    Evans, C.
    Freimer, N. B.
    [J]. NEUROSCIENCE, 2009, 164 (01) : 30 - 42
  • [5] Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
    Burton, Paul R.
    Clayton, David G.
    Cardon, Lon R.
    Craddock, Nick
    Deloukas, Panos
    Duncanson, Audrey
    Kwiatkowski, Dominic P.
    McCarthy, Mark I.
    Ouwehand, Willem H.
    Samani, Nilesh J.
    Todd, John A.
    Donnelly, Peter
    Barrett, Jeffrey C.
    Davison, Dan
    Easton, Doug
    Evans, David
    Leung, Hin-Tak
    Marchini, Jonathan L.
    Morris, Andrew P.
    Spencer, Chris C. A.
    Tobin, Martin D.
    Attwood, Antony P.
    Boorman, James P.
    Cant, Barbara
    Everson, Ursula
    Hussey, Judith M.
    Jolley, Jennifer D.
    Knight, Alexandra S.
    Koch, Kerstin
    Meech, Elizabeth
    Nutland, Sarah
    Prowse, Christopher V.
    Stevens, Helen E.
    Taylor, Niall C.
    Walters, Graham R.
    Walker, Neil M.
    Watkins, Nicholas A.
    Winzer, Thilo
    Jones, Richard W.
    McArdle, Wendy L.
    Ring, Susan M.
    Strachan, David P.
    Pembrey, Marcus
    Breen, Gerome
    St Clair, David
    Caesar, Sian
    Gordon-Smith, Katherine
    Jones, Lisa
    Fraser, Christine
    Green, Elain K.
    [J]. NATURE, 2007, 447 (7145) : 661 - 678
  • [6] Increased hospital mortality in patients with bedside hippus
    Denny, Joshua C.
    Arndt, Frederick V.
    Dupont, William D.
    Neilson, Eric G.
    [J]. AMERICAN JOURNAL OF MEDICINE, 2008, 121 (03) : 239 - 245
  • [7] Denny Joshua C, 2005, AMIA Annu Symp Proc, P196
  • [8] Clinical phenome scanning
    Ghebranious, Nader
    McCarty, Catherine A.
    Wilke, Russell A.
    [J]. PERSONALIZED MEDICINE, 2007, 4 (02) : 175 - 182
  • [9] Variants conferring risk of atrial fibrillation on chromosome 4q25
    Gudbjartsson, Daniel F.
    Arnar, David O.
    Helgadottir, Anna
    Gretarsdottir, Solveig
    Holm, Hilma
    Sigurdsson, Asgeir
    Jonasdottir, Adalbjorg
    Baker, Adam
    Thorleifsson, Gudmar
    Kristjansson, Kristleifur
    Palsson, Arnar
    Blondal, Thorarinn
    Sulem, Patrick
    Backman, Valgerdur M.
    Hardarson, Gudmundur A.
    Palsdottir, Ebba
    Helgason, Agnar
    Sigurjonsdottir, Runa
    Sverrisson, Jon T.
    Kostulas, Konstantinos
    Ng, Maggie C. Y.
    Baum, Larry
    So, Wing Yee
    Wong, Ka Sing
    Chan, Juliana C. N.
    Furie, Karen L.
    Greenberg, Steven M.
    Sale, Michelle
    Kelly, Peter
    MacRae, Calum A.
    Smith, Eric E.
    Rosand, Jonathan
    Hillert, Jan
    Ma, Ronald C. W.
    Ellinor, Patrick T.
    Thorgeirsson, Gudmundur
    Gulcher, Jeffrey R.
    Kong, Augustine
    Thorsteinsdottir, Unnur
    Stefansson, Kari
    [J]. NATURE, 2007, 448 (7151) : 353 - 357
  • [10] Risk alleles for multiple sclerosis identified by a genomewide study
    Hafler, David A.
    Compston, Alastair
    Sawcer, Stephen
    Lander, Eric S.
    Daly, Mark J.
    De Jager, Philip L.
    de Bakker, Paul I. W.
    Gabriel, Stacey B.
    Mirel, Daniel B.
    Ivinson, Adrian J.
    Pericak-Vance, Margaret A.
    Gregory, Simon G.
    Rioux, John D.
    McCauley, Jacob L.
    Haines, Jonathan L.
    Barcellos, Lisa F.
    Cree, Bruce
    Oksenberg, Jorge R.
    Hauser, Stephen L.
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2007, 357 (09) : 851 - 862