Bayesian Variable and Model Selection Methods for Genetic Association Studies

被引:33
作者
Fridley, Brooke L. [1 ]
机构
[1] Mayo Clin, Div Biostat, Coll Med, Dept Hlth Sci Res, Rochester, MN 55905 USA
关键词
Bayesian model averaging; Bayesian variable selection; candidate gene; Markov chain Monte Carlo; reversible jump MCMC; stochastic search variable selection; COMPLEMENT FACTOR-H; GENOME-WIDE ASSOCIATION; APOLIPOPROTEIN-E; HAPLOTYPE RECONSTRUCTION; MACULAR DEGENERATION; STRANDED-RNA; SUSCEPTIBILITY; POLYMORPHISM; RECOGNITION; DISEASE;
D O I
10.1002/gepi.20353
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Variable selection is growing in importance with the advent of high throughput genotyping methods requiring analysis of hundreds to thousands of single nucleotide polymorphisms (SNPs) and the increased interest in using these genetic studies to better understand common, complex diseases. Up to now, the standard approach has been to analyze the genotypes for each SNP individually to look for an association with a disease. Alternatively, combinations of SNPs or haplotypes are analyzed for association. Another added complication in studying complex diseases or phenotypes is that genetic risk for the disease is often due to multiple SNPs in various locations on the chromosome with small individual effects that may have a collectively large effect on the phenotype. Hence, multi-locus SNP models, as opposed to single SNP models, may better capture the true underlying genotypic-phenotypic relationship. Thus, innovative methods for determining which SNPs to include in the model are needed. The goal of this article is to describe several methods currently available for variable and model selection using Bayesian approaches and to illustrate their application for genetic association studies using both real and simulated candidate gene data for a complex disease. In particular, Bayesian model averaging (BMA), stochastic search variable selection (SSVS), and Bayesian variable selection (BVS) using a reversible jump Markov chain Monte Carlo (MCMC) for candidate gene association studies are illustrated using a Study of age-related macular degeneration (AMD) and simulated data. Genet. Epidemiol. 33:27-37, 2009. (C) 2008 Wiley-Liss, Inc.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 81 条
[1]  
Akaike H, 1992, Breakthroughs in statistics, VI, P610, DOI DOI 10.1007/978-1-4612-1694-0_15
[2]   BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA [J].
ALBERT, JH ;
CHIB, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) :669-679
[3]   Recognition of double-stranded RNA and activation of NF-κB by Toll-like receptor 3 [J].
Alexopoulou, L ;
Holt, AC ;
Medzhitov, R ;
Flavell, RA .
NATURE, 2001, 413 (6857) :732-738
[4]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[5]  
Ball RD, 2001, GENETICS, V159, P1351
[6]  
BAYES T, 1763, PHILOS T ROY SOC LON, P330
[7]   The Bayesian revolution in genetics [J].
Beaumont, MA ;
Rannala, B .
NATURE REVIEWS GENETICS, 2004, 5 (04) :251-261
[8]   Bayesian analysis: A look at today and thoughts of tomorrow [J].
Berger, JO .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (452) :1269-1276
[9]   Quantitative trait nucleotide analysis using Bayesian model selection [J].
Blangero, J ;
Göring, HHH ;
Kent, JW ;
Williams, JT ;
Peterson, CP ;
Almasy, L ;
Dyer, TD .
HUMAN BIOLOGY, 2005, 77 (05) :541-559
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32