Quantifying bias due to allele misclassification in case-control studies of haplotypes

被引:11
作者
Govindarajulu, Usha S.
Spiegelman, Donna
Miller, Katie L.
Kraft, Peter
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
[3] Harvard Univ, Sch Publ Hlth, Dept Environm Hlth, Boston, MA 02115 USA
[4] Harvard Univ, Sch Publ Hlth, Program Mol & Genet Epidemiol, Boston, MA 02115 USA
关键词
haplotype; misclassification; odds ratio; case-control; SNP; genotyping error;
D O I
10.1002/gepi.20170
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学]; 090102 [作物遗传育种];
摘要
Objectives Genotyping errors can induce biases in frequency estimates for haplotypes of single nucleotide polymorphisms (SNPs). Here, we considered the impact of SNP allele misclassification on haplotype odds ratio estimates from case-control studies of unrelated individuals. Methods We calculated bias analytically, using the haplotype counts expected in cases and controls under genotype misclassification. We evaluated the bias due to allele misclassification across a range of haplotype distributions using empirical haplotype frequencies within blocks of limited haplotype diversity We also considered simple two- and three-locus haplotype distributions to understand the impact of haplotype frequency and number of SNPs on misclassification bias. Results We found that for common haplotypes (> 5% frequency), realistic genotyping error rates (0.1-1% chance of miscalling an allele), and moderate relative risks (2-4), the bias was always towards the null and increases in magnitude with increasing error rate, increasing odds ratio. For common haplotypes, bias generally increased with increasing haplotype frequency, while for rare haplotypes, bias generally increased with decreasing frequency. When the chance of miscalling an allele is 0.5%, the median bias in haplotype-specific odds ratios for common haplotypes was generally small (< 4% on the log odds ratio scale), but the bias for some individual haplotypes was larger (10-20%). Bias towards the null leads to a loss in power; the relative efficiency using a test statistic based upon misclassified haplotype data compared to a test based on the unobserved true haplotypes ranged from roughly 60% to 80%, and worsened with increasing haplotype frequency. Conclusions The cumulative effect of small allele-calling errors across multiple loci can induce noticeable bias and reduce power in realistic scenarios. This has implications for the design of candidate gene association studies that utilize multimarker haplotypes.
引用
收藏
页码:590 / 601
页数:12
相关论文
共 33 条
[1]
Haplotypes vs single marker linkage disequilibrium tests:: what do we gain? (Reprinted European Journal of Human Genetics, Vol 4, pg 291-300, 2001) [J].
Akey, Joshua ;
Jin, Li ;
Xiong, Momiao .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2017, 25 :S51-S58
[2]
[Anonymous], ENCY BIOSTATISTICS
[3]
[Anonymous], 1982, EPIDEMIOLOGIC RES
[4]
[Anonymous], 2004, INTER DISC
[5]
Armstrong B.K.W.E., 1994, Principles of Exposure Measurement in Epidemiology
[6]
Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium [J].
Carlson, CS ;
Eberle, MA ;
Rieder, MJ ;
Yi, Q ;
Kruglyak, L ;
Nickerson, DA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (01) :106-120
[7]
The role of haplotypes in candidate gene studies [J].
Clark, AG .
GENETIC EPIDEMIOLOGY, 2004, 27 (04) :321-333
[8]
BIAS DUE TO MISCLASSIFICATION IN ESTIMATION OF RELATIVE RISK [J].
COPELAND, KT ;
CHECKOWAY, H ;
MCMICHAEL, AJ ;
HOLBROOK, RH .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1977, 105 (05) :488-495
[9]
Quantification of the power of Hardy-Weinberg equilibrium testing to detect genotyping error [J].
Cox, DG ;
Kraft, P .
HUMAN HEREDITY, 2006, 61 (01) :10-14
[10]
Inference on haplotype effects in case-control studies using unphased genotype data [J].
Epstein, MP ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1316-1329