Error detection in SNP data by considering the likelihood of recombinational history implied by three-site combinations

被引:12
作者
Toleno, Donna M. [1 ]
Morrell, Peter L. [1 ]
Clegg, Michael T. [1 ]
机构
[1] Univ Calif Irvine, Dept Ecol & Evolutionary Biol, Irvine, CA 92697 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btm260
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Errors in nucleotice sequence and SNP genotyping data are problematic when inferring haplotypes. Previously published methods for error detection in haplotype data make use of pedigree information; however, for many samples, individuals are not related by pedigree. This article describes a method for detecting errors in haplotypes by considering the recombinational history implied by the patterns of variation, three SNPs at a time. Results: Coalescent simulations provide evidence that the method is robust to high levels of recombination as well as homologous gene conversion, indicating that patterns produced by both proximate and distant SNPs may be useful for detecting unlikely three-site haplotypes.
引用
收藏
页码:1807 / 1814
页数:8
相关论文
共 37 条
[1]   Merlin-rapid analysis of dense genetic maps using sparse gene flow trees [J].
Abecasis, GR ;
Cherny, SS ;
Cookson, WO ;
Cardon, LR .
NATURE GENETICS, 2002, 30 (01) :97-101
[2]   Identification of probable genotyping errors by consideration of haplotypes [J].
Becker, T ;
Valentonyte, R ;
Croucher, PJP ;
Strauch, K ;
Schreiber, S ;
Hampe, J ;
Knapp, M .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2006, 14 (04) :450-458
[3]   The role of haplotypes in candidate gene studies [J].
Clark, AG .
GENETIC EPIDEMIOLOGY, 2004, 27 (04) :321-333
[4]   Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase [J].
Clark, AG ;
Weiss, KM ;
Nickerson, DA ;
Taylor, SL ;
Buchanan, A ;
Stengård, J ;
Salomaa, V ;
Vartiainen, E ;
Perola, M ;
Boerwinkle, E ;
Sing, CF .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 63 (02) :595-612
[5]   PCR-mediated recombination in amplification products derived from polyploid cotton [J].
Cronn, R ;
Cedroni, M ;
Haselkorn, T ;
Grover, C ;
Wendel, JF .
THEORETICAL AND APPLIED GENETICS, 2002, 104 (2-3) :482-489
[6]   A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data [J].
Douglas, JA ;
Boehnke, M ;
Lange, K .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (04) :1287-1297
[7]   Consed: A graphical tool for sequence finishing [J].
Gordon, D ;
Abajian, C ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :195-202
[8]  
HALLDORSSON BV, 2004, COMPUTATIONAL METHOD
[9]   Generating consistent genotypic configurations for multi-allelic loci and large complex pedigrees [J].
Heath, SC .
HUMAN HEREDITY, 1998, 48 (01) :1-11
[10]   Generating samples under a Wright-Fisher neutral model of genetic variation [J].
Hudson, RR .
BIOINFORMATICS, 2002, 18 (02) :337-338