Identification of probable genotyping errors by consideration of haplotypes

被引:14
作者
Becker, T
Valentonyte, R
Croucher, PJP
Strauch, K
Schreiber, S
Hampe, J
Knapp, M
机构
[1] Univ Bonn, Inst Med Biometry Informat & Epidemiol, D-53105 Bonn, Germany
[2] Univ Kiel, Inst Clin Mol Biol, Kiel, Germany
[3] Univ Hosp Schleswig Holstein, Kiel, Germany
[4] Univ Kiel, Inst Med Informat & Stat, Kiel, Germany
[5] Univ Marburg, Inst Med Biometry & Epidemiol, Marburg, Germany
关键词
genotype error; haplotype; frequency estimation;
D O I
10.1038/sj.ejhg.5201565
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Undetected genotyping errors pose a problem in genetic epidemiological studies, as they may invalidate statistical analysis or reduce its power. Haplotype analysis requires an improved standard of the data, because a haplotype can be inferred correctly only if the genotypes of all its markers are correct. Here, we present a method that identifies probable genotyping errors in trio samples with the help of the estimated haplotype frequency distribution of the sample. If the likelihood of the most likely haplotype explanation depends strongly on just one genotype, in the sense that setting the genotype to be missing leads to a much more likely haplotype explanation, this genotype is considered as a potential genotyping error. We describe a method that systematically searches the whole data set for such potential errors. Based on the haplotype distribution of a real data set, we carry out a simulation study to estimate the sensitivity and specifity of the method. In addition, we apply our approach to the real data set itself. Potentially erroneous genotypes are re-determined via sequencing. The results of both the simulation study and of the application to the real data set show that a considerable proportion of true genotyping errors is detected and that the number of false-positive signals is acceptable. We conclude that it is indeed possible to identify probable genotyping errors by considering haplotypes. The method described here will be part of the next release of our FAMHAP software.
引用
收藏
页码:450 / 458
页数:9
相关论文
共 14 条
[1]   Finding haplotype block boundaries by using the minimum-description-length principle [J].
Anderson, EC ;
Novembre, J .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (02) :336-354
[2]   Maximum-likelihood estimation of haplotype frequencies in nuclear families [J].
Becker, T ;
Knapp, M .
GENETIC EPIDEMIOLOGY, 2004, 27 (01) :21-32
[3]   A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data [J].
Douglas, JA ;
Boehnke, M ;
Lange, K .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (04) :1287-1297
[4]   Probability of detection of genotyping errors and mutations as inheritance inconsistencies in nuclear-family data [J].
Douglas, JA ;
Skol, AD ;
Boehnke, M .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (02) :487-495
[5]  
Ehm MG, 1996, AM J HUM GENET, V58, P225
[6]   True pedigree errors more frequent than apparent errors for single nucleotide polymorphisms [J].
Gordon, D ;
Heath, SC ;
Ott, J .
HUMAN HEREDITY, 1999, 49 (02) :65-70
[7]   A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data [J].
Gordon, D ;
Heath, SC ;
Liu, X ;
Ott, J .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (02) :371-380
[8]   Detection of genotyping errors by Hardy-Weinberg equilibrium testing [J].
Hosking, L ;
Lumsden, S ;
Lewis, K ;
Yeo, A ;
McCarthy, L ;
Bansal, A ;
Riley, J ;
Purvis, I ;
Xu, CF .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2004, 12 (05) :395-399
[9]   Impact of genotyping errors on type I error rate of the haplotype-sharing transmission/disequilibrium test (HS-TDT) [J].
Knapp, M ;
Becker, T .
AMERICAN JOURNAL OF HUMAN GENETICS, 2004, 74 (03) :589-591
[10]   Undetected genotyping errors cause apparent overtransmission of common alleles in the transmission/disequilibrium test [J].
Mitchell, AA ;
Cutler, DJ ;
Chakravarti, A .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (03) :598-610