Robust estimation and testing of haplotype effects in case-control studies

被引:6
作者
Allen, Andrew S. [1 ,2 ]
Satten, Glen A. [3 ]
机构
[1] Duke Univ, Dept Biostat & Bioinformat, Durham, NC 27710 USA
[2] Duke Univ, Duke Clin Res Inst, Durham, NC 27710 USA
[3] Ctr Dis Control & Prevent, Atlanta, GA USA
关键词
case-control; haplotypes; nuisance parameter; efficient score; retrospective likelihood; prospective likelihood;
D O I
10.1002/gepi.20259
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学]; 090102 [作物遗传育种];
摘要
Haplotype-based analyses are thought to play a major role in the study of common complex diseases. This has led to the development of a variety of statistical methods for detecting disease-haplotype associations from case-control study data. However, haplotype phase is often uncertain when only genotype data is available. Methods that account for haplotype ambiguity by modeling the distribution of haplotypes can, if this distribution is misspecified, lead to substantial bias in parameter estimates even when complete genotype data is available. Here we study estimators that can be derived from score functions of appropriate likelihoods. We use the efficient score approach to estimation in the presence of nuisance parameters to a derive novel estimators that are robust to the haplotype distribution. We establish key relationships between estimators and study their empirical performance via simulation.
引用
收藏
页码:29 / 40
页数:12
相关论文
共 23 条
[1]
Haplotypes vs single marker linkage disequilibrium tests:: what do we gain? (Reprinted European Journal of Human Genetics, Vol 4, pg 291-300, 2001) [J].
Akey, Joshua ;
Jin, Li ;
Xiong, Momiao .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2017, 25 :S51-S58
[2]
Locally-efficient robust estimation of haplotype-disease association in family-based studies [J].
Allen, AS ;
Satten, GA ;
Tsiatis, AA .
BIOMETRIKA, 2005, 92 (03) :559-571
[3]
Bickel Peter J, 1993, Efficient and adaptive estimation for semiparametric models, V4
[4]
Serniparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies [J].
Chatterjee, N ;
Carroll, RJ .
BIOMETRIKA, 2005, 92 (02) :399-418
[5]
MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]
Inference on haplotype effects in case-control studies using unphased genotype data [J].
Epstein, MP ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1316-1329
[7]
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
[8]
Genetic analysis of case/control data using estimated haplotype frequencies: Application to APOE locus variation and Alzheimer's disease [J].
Fallin, D ;
Cohen, A ;
Essioux, L ;
Chumakov, I ;
Blumenfeld, M ;
Cohen, D ;
Schork, NJ .
GENOME RESEARCH, 2001, 11 (01) :143-151
[9]
HAPLO - A PROGRAM USING THE EM ALGORITHM TO ESTIMATE THE FREQUENCIES OF MULTISITE HAPLOTYPES [J].
HAWLEY, ME ;
KIDD, KK .
JOURNAL OF HEREDITY, 1995, 86 (05) :409-411
[10]
Estimation and tests of haplotype-environment interaction when linkage phase is ambiguous [J].
Lake, SL ;
Lyon, H ;
Tantisira, K ;
Silverman, EK ;
Weiss, ST ;
Laird, NM ;
Schaid, DJ .
HUMAN HEREDITY, 2003, 55 (01) :56-65