Haplotype inference in random population samples

被引:139
作者
Lin, S [1 ]
Cutler, DJ [1 ]
Zwick, ME [1 ]
Chakravarti, A [1 ]
机构
[1] Johns Hopkins Univ, Sch Med, McKusick Nathans Inst Genet Med, Baltimore, MD 21287 USA
关键词
D O I
10.1086/344347
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Contemporary genotyping and sequencing methods do not provide information on linkage phase in diploid organisms. The application of statistical methods to infer and reconstruct linkage phase in samples of diploid sequences is a potentially time- and labor-saving method. The Stephens-Smith-Donnelly (SSD) algorithm is one such method, which incorporates concepts from population genetics theory in a Markov chain-Monte Carlo technique. We applied a modified SSD method, as well as the expectation-maximization and partition-ligation algorithms, to sequence data from eight loci spanning >1 Mb on the human X chromosome. We demonstrate that the accuracy of the modified SSD method is better than that of the other algorithms and is superior in terms of the number of sites that may be processed. Also, we find phase reconstructions by the modified SSD method to be highly accurate over regions with high linkage disequilibrium (LD). If only polymorphisms with a minor allele frequency >0.2 are analyzed and scored according to the fraction of neighbor relations correctly called, reconstructions are 95.2% accurate over entire 100-kb stretches and are 98.6% accurate within blocks of high LD.
引用
收藏
页码:1129 / 1137
页数:9
相关论文
共 24 条
  • [1] [Anonymous], 1981, Statistical Tables
  • [2] CLARK AG, 1990, MOL BIOL EVOL, V7, P111
  • [3] A DNA polymorphism discovery resource for research on human genetic variation
    Collins, FS
    Brooks, LD
    Chakravarti, A
    [J]. GENOME RESEARCH, 1998, 8 (12) : 1229 - 1231
  • [4] High-throughput variation detection and genotyping using microarrays
    Cutler, DJ
    Zwick, ME
    Carrasquillo, MM
    Yohn, CT
    Tobin, KP
    Kashuk, C
    Mathews, DJ
    Shah, NA
    Eichler, EE
    Warrington, JA
    Chakravarti, A
    [J]. GENOME RESEARCH, 2001, 11 (11) : 1913 - 1925
  • [5] High-resolution haplotype structure in the human genome
    Daly, MJ
    Rioux, JD
    Schaffner, SE
    Hudson, TJ
    Lander, ES
    [J]. NATURE GENETICS, 2001, 29 (02) : 229 - 232
  • [6] Experimentally-derived haplotypes substantially increase the efficiency of linkage disequilibrium studies
    Douglas, JA
    Boehnke, M
    Gillanders, E
    Trent, JA
    Gruber, SB
    [J]. NATURE GENETICS, 2001, 28 (04) : 361 - 364
  • [7] EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
  • [8] Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data
    Fallin, D
    Schork, NJ
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (04) : 947 - 959
  • [9] Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn's disease
    Hugot, JP
    Chamaillard, M
    Zouali, H
    Lesage, S
    Cézard, JP
    Belaiche, J
    Almer, S
    Tysk, C
    O'Morain, CA
    Gassull, M
    Binder, V
    Finkel, Y
    Cortot, A
    Modigliani, R
    Laurent-Puig, P
    Gower-Rousseau, C
    Macry, J
    Colombel, JF
    Sahbatou, M
    Thomas, G
    [J]. NATURE, 2001, 411 (6837) : 599 - 603
  • [10] Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex
    Jeffreys, AJ
    Kauppi, L
    Neumann, R
    [J]. NATURE GENETICS, 2001, 29 (02) : 217 - 222