Regression-based association analysis with clustered haplotypes through use of genotypes

被引:65
作者
Tzeng, JY
Wang, CH
Kao, JT
Hsiao, CK
机构
[1] N Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
[2] N Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[3] Cardinal Tien Hosp, Dept Cardiol, Taipei, Taiwan
[4] Fu Jen Catholic Univ, Dept Med, Coll Med, Taipei, Taiwan
[5] Natl Taiwan Univ, Coll Med, Dept Clin Lab Sci & Med Biotechnol, Taipei 10018, Taiwan
[6] Natl Taiwan Univ, Inst Epidemiol, Div Biostat, Taipei 10764, Taiwan
关键词
D O I
10.1086/500025
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Haplotype-based association analysis has been recognized as a tool with high resolution and potentially great power for identifying modest etiological effects of genes. However, in practice, its efficacy has not been as successfully reproduced as expected in theory. One primary cause is that such analysis tends to require a large number of parameters to capture the abundant haplotype varieties, and many of those are expended on rare haplotypes for which studies would have insufficient power to detect association even if it existed. To concentrate statistical power on more-relevant inferences, in this study, we developed a regression-based approach using clustered haplotypes to assess haplotype-phenotype association. Specifically, we generalized the probabilistic clustering methods of Tzeng to the generalized linear model ( GLM) framework established by Schaid et al. The proposed method uses unphased genotypes and incorporates both phase uncertainty and clustering uncertainty. Its GLM framework allows adjustment of covariates and can model qualitative and quantitative traits. It can also evaluate the overall haplotype association or the individual haplotype effects. We applied the proposed approach to study the association between hypertriglyceridemia and the apolipoprotein A5 gene. Through simulation studies, we assessed the performance of the proposed approach and demonstrate its validity and power in testing for haplotype-trait association.
引用
收藏
页码:231 / 242
页数:12
相关论文
共 56 条
[1]   Hypertriglyceridemia and elevated lipoprotein(a) are risk factors for major coronary events in middle-aged men [J].
Assmann, G ;
Schulte, H ;
vonEckardstein, A .
AMERICAN JOURNAL OF CARDIOLOGY, 1996, 77 (14) :1179-1184
[2]   ON GENERALIZED SCORE TESTS [J].
BOOS, DD .
AMERICAN STATISTICIAN, 1992, 46 (04) :327-333
[3]   Missing data in haplotype analysis: a study on the MILC method [J].
Bourgain, C ;
Genin, E ;
Ober, C ;
Clerget-Darpoux, F .
ANNALS OF HUMAN GENETICS, 2002, 66 :99-108
[4]   Use of closely related affected individuals for the genetic study of complex diseases in founder populations [J].
Bourgain, C ;
Génin, E ;
Holopainen, P ;
Mustalahti, K ;
Mäki, M ;
Partanen, J ;
Clerget-Darpoux, F .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (01) :154-159
[5]   Search for multifactorial disease susceptibility genes in founder populations [J].
Bourgain, C ;
Genin, E ;
Quesneville, H ;
Clerget-Darpoux, F .
ANNALS OF HUMAN GENETICS, 2000, 64 :255-265
[6]   Detecting disease associations due to linkage disequilibrium using haplotype tags: A class of tests and the determinants of statistical power [J].
Chapman, JM ;
Cooper, JD ;
Todd, JA ;
Clayton, DG .
HUMAN HEREDITY, 2003, 56 (1-3) :18-31
[7]   Use of unphased multilocus genotype data in indirect association studies [J].
Clayton, D ;
Chapman, J ;
Cooper, J .
GENETIC EPIDEMIOLOGY, 2004, 27 (04) :415-428
[8]   Variations on a theme: Cataloging human DNA sequence variation [J].
Collins, FS ;
Guyer, MS ;
Chakravarti, A .
SCIENCE, 1997, 278 (5343) :1580-1581
[9]  
CRANDALL KA, 1993, GENETICS, V134, P959
[10]   Evidence that triglycerides are an independent coronary heart disease risk factor [J].
Cullen, P .
AMERICAN JOURNAL OF CARDIOLOGY, 2000, 86 (09) :943-949