What SNP genotyping errors are most costly for genetic association studies?

被引:68
作者
Kang, SJ
Gordon, D
Finch, SJ
机构
[1] Rockefeller Univ, Lab Stat Genet, New York, NY 10021 USA
[2] SUNY Stony Brook, Dept Appl Math & Stat, Stony Brook, NY 11794 USA
关键词
genotype error; chi-square; noncentrality parameter; cost; error detection; linkage disequilibrium; test of independence;
D O I
10.1002/gepi.10301
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Which genotype misclassification errors are most costly, in terms of increased sample size necessary (SSN) to maintain constant asymptotic power and significance level, when performing case/control studies of genetic association? We answer this question for single-nucleotide polymorphisms (SNPs), using the 2 x 3 chi(2) test of independence. Our strategy is to expand the noncentrality parameter of the asymptotic distribution of the chi(2) test under a specified alternative hypothesis to approximate SSN, using a linear Taylor series in the error parameters. We consider two scenarios: the first assumes Hardy-Weinberg equilibrium (HWE) for the true genotypes in both cases and controls, and the second assumes HWE only in controls. The Taylor series approximation has a relative error of less than 1% when each error rate is less than 2%. The most costly error is recording the more common homozygote as the less common homozygote, with indefinitely increasing cost coefficient as minor SNP allele frequencies approach 0 in both scenarios. The cost of misclassifying the more common homozygote to the heterozygote also becomes indefinitely large as the minor SNP allele frequency goes to 0 under both scenarios. For the violation of HWE modeled here, the cost of misclassifying a heterozygote to the less common homozygote becomes large, although bounded. Therefore, the use of SNPs with a small minor allele frequency requires careful attention to the frequency of genotyping errors to ensure that power specifications are met. Furthermore, the design of automated genotyping should minimize those errors whose cost coefficients can become indefinitely large.
引用
收藏
页码:132 / 141
页数:10
相关论文
共 34 条