Estimation of genotype error rate using samples with pedigree information - an application on the GeneChip Mapping 10K array

被引:39
作者
Hao, K
Li, C
Rosenow, C
Wong, WH
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Dept Biostat, Boston, MA USA
[3] Genom Collaborat, Santa Clara, CA USA
[4] Harvard Univ, Dept Stat, Sch Med, Cambridge, MA USA
关键词
genotyping error; Mendelian inheritance; GeneChip Mapping 10K array; single nucleotide polymorphism;
D O I
10.1016/j.ygeno.2004.05.003
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Currently, most analytical methods assume all observed genotypes are correct; however, it is clear that errors may reduce statistical power or bias inference in genetic studies. We propose procedures for estimating error rate in genetic analysis and apply them to study the GeneChip Mapping 10K array, which is a technology that has recently become available and allows researchers to survey over 10,000 SNPs in a single assay. We employed a strategy to estimate the genotype error rate in pedigree data. First, the "dose -response" reference curve between error rate and the observable error number were derived by simulation, conditional on given pedigree structures and genotypes. Second, the error rate was estimated by calibrating the number of observed errors in real data to the reference curve. We evaluated the performance of this method by simulation study and applied it to a data set of 30 pedigrees genotyped using the GeneChip Mapping 10K array. This method performed favorably in all scenarios we surveyed. The dose-response reference curve was monotone and almost linear with a large slope. The method was able to estimate accurately the error rate under various pedigree structures and error models and under heterogeneous error rates. Using this method, we found that the average genotyping error rate of the GeneChip Mapping I OK array was about 0.1%. Our method provides a quick and unbiased solution to address the genotype error rate in pedigree data. It behaves well in a wide range of settings and can be easily applied in other genetic projects. The robust estimation of genotyping error rate allows us to estimate power and sample size and conduct unbiased genetic tests. The GeneChip Mapping I OK array has a low overall error rate, which is consistent with the results obtained from alternative genotyping assays. (C) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:623 / 630
页数:8
相关论文
共 23 条
[1]   Merlin-rapid analysis of dense genetic maps using sparse gene flow trees [J].
Abecasis, GR ;
Cherny, SS ;
Cookson, WO ;
Cardon, LR .
NATURE GENETICS, 2002, 30 (01) :97-101
[2]   The impact of genotyping error on family-based analysis of quantitative traits [J].
Abecasis, GR ;
Cherny, SS ;
Cardon, LR .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2001, 9 (02) :130-134
[3]   The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures [J].
Akey, JM ;
Zhang, K ;
Xiong, MM ;
Doris, P ;
Jin, L .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (06) :1447-1456
[4]  
[Anonymous], BIOTECHNIQUES S
[5]  
BUETOW KH, 1991, AM J HUM GENET, V49, P985
[6]   Probability of detection of genotyping errors and mutations as inheritance inconsistencies in nuclear-family data [J].
Douglas, JA ;
Skol, AD ;
Boehnke, M .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (02) :487-495
[7]  
Ehm MG, 1996, AM J HUM GENET, V58, P225
[8]   Detection rates for genotyping errors in SNPs using the trio design [J].
Geller, F ;
Ziegler, A .
HUMAN HEREDITY, 2002, 54 (03) :111-117
[9]   The effects of genotyping errors and interference on estimation of genetic distance [J].
Goldstein, DR ;
Zhao, HY ;
Speed, TP .
HUMAN HEREDITY, 1997, 47 (02) :86-100
[10]  
Gordon D, 2001, Pac Symp Biocomput, P18