A Comparison of Strategies for Analyzing Dichotomous Outcomes in Genome-Wide Association Studies With General Pedigrees

被引:14
作者
Chen, Ming-Huei [2 ,3 ]
Liu, Xuan
Wei, Fengrong [4 ]
Larson, Martin G. [3 ,5 ]
Fox, Caroline S. [3 ,6 ,7 ]
Vasan, Ramachandran S. [3 ]
Yang, Qiong [1 ,3 ]
机构
[1] Boston Univ, Sch Publ Hlth, Dept Biostat, Crosstown Ctr, Boston, MA 02118 USA
[2] Boston Univ, Sch Med, Dept Neurol, Boston, MA 02118 USA
[3] NHLBIs Framingham Heart Study, Framingham, MA USA
[4] Univ W Georgia, Dept Math, Carrollton, GA USA
[5] Boston Univ, Dept Math & Stat, Boston, MA 02118 USA
[6] Harvard Univ, Sch Med, Boston, MA USA
[7] Brigham & Womens Hosp, Div Endocrinol Hypertens & Diabet, Boston, MA 02115 USA
关键词
genetic association; dichotomous phenotypes; familial relatedness; LINEAR MIXED MODELS; ESTIMATING EQUATIONS; VARIANCE; ACCOUNT;
D O I
10.1002/gepi.20614
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Genome-wide association studies (GWAS) have been frequently conducted on general or isolated populations with related individuals. However, there is a lack of consensus on which strategy is most appropriate for analyzing dichotomous phenotypes in general pedigrees. Using simulation studies, we compared several strategies including generalized estimating equations (GEE) strategies with various working correlation structures, generalized linear mixed model (GLMM), and a variance component strategy (denoted LMEBIN) that treats dichotomous outcomes as continuous with special attentions to their performance with rare variants, rare diseases, and small sample sizes. In our simulations, when the sample size is not small, for type I error, only GEE and LMEBIN maintain nominal type I error in most cases with exceptions for GEE with very rare disease and genetic variants. GEE and LMEBIN have similar statistical power and slightly outperform GLMM when the prevalence is low. In terms of computational efficiency, GEE with sandwich variance estimator outperforms GLMM and LMEBIN. We apply the strategies to GWAS of gout in the Framingham Heart Study. Based on our results, we would recommend using GEE ind-san in the GWAS for common variants and GEE ind-fij or LMEBIN for rare variants for GWAS of dichotomous outcomes with general pedigrees. Genet. Epidemiol. 35:650-657, 2011. (C) 2011 Wiley Periodicals, Inc.
引用
收藏
页码:650 / 657
页数:8
相关论文
共 18 条
[1]
Multipoint quantitative-trait linkage analysis in general pedigrees [J].
Almasy, L ;
Blangero, J .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 62 (05) :1198-1211
[2]
ATKINSON B., 2009, KINSHIP MIXED EFFECT
[3]
Linear mixed models and penalized least squares [J].
Bates, DM ;
DebRoy, S .
JOURNAL OF MULTIVARIATE ANALYSIS, 2004, 91 (01) :1-17
[4]
APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25
[5]
Case-control single-marker and haplotypic association analysis of pedigree data [J].
Browning, SR ;
Briley, JD ;
Briley, LP ;
Chandra, G ;
Charnecki, JH ;
Ehm, MG ;
Johansson, KA ;
Jones, BJ ;
Karter, AJ ;
Yarnall, DP ;
Wagner, MJ .
GENETIC EPIDEMIOLOGY, 2005, 28 (02) :110-122
[6]
GWAF: an R package for genome-wide association analyses with family data [J].
Chen, Ming-Huei ;
Yang, Qiong .
BIOINFORMATICS, 2010, 26 (04) :580-581
[7]
WALDS TEST AS APPLIED TO HYPOTHESES IN LOGIT ANALYSIS [J].
HAUCK, WW ;
DONNER, A .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1977, 72 (360) :851-853
[8]
Variance component model to account for sample structure in genome-wide association studies [J].
Kang, Hyun Min ;
Sul, Jae Hoon ;
Service, Susan K. ;
Zaitlen, Noah A. ;
Kong, Sit-yee ;
Freimer, Nelson B. ;
Sabatti, Chiara ;
Eskin, Eleazar .
NATURE GENETICS, 2010, 42 (04) :348-U110
[9]
A note on the efficiency of sandwich covariance matrix estimation [J].
Kauermann, G ;
Carroll, RJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1387-1396
[10]
LIANG KY, 1986, BIOMETRIKA, V73, P13, DOI 10.1093/biomet/73.1.13