Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation

被引:1149
作者
Stephens, M [1 ]
Scheet, P [1 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
关键词
D O I
10.1086/428594
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Although many algorithms exist for estimating haplotypes from genotype data, none of them take full account of both the decay of linkage disequilibrium (LD) with distance and the order and spacing of genotyped markers. Here, we describe an algorithm that does take these factors into account, using a flexible model for the decay of LD with distance that can handle both "blocklike" and "nonblocklike" patterns of LD. We compare the accuracy of this approach with a range of other available algorithms in three ways: for reconstruction of randomly paired, molecularly determined male X chromosome haplotypes; for reconstruction of haplotypes obtained from trios in an autosomal region; and for estimation of missing genotypes in 50 autosomal genes that have been completely resequenced in 24 African Americans and 23 individuals of European descent. For the autosomal data sets, our new approach clearly outperforms the best available methods, whereas its accuracy in inferring the X chromosome haplotypes is only slightly superior. For estimation of missing genotypes, our method performed slightly better when the two subsamples were combined than when they were analyzed separately, which illustrates its robustness to population stratification. Our method is implemented in the software package PHASE (v2.1.1), available from the Stephens Lab Web site.
引用
收藏
页码:449 / 462
页数:14
相关论文
共 33 条
  • [1] [Anonymous], RECOMB ANN INT C RES
  • [2] Besag, 1994, J R STAT SOC B, V56, P591, DOI DOI 10.1111/J.2517-6161.1994.TB02000.X
  • [3] Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans
    Carlson, CS
    Eberle, MA
    Rieder, MJ
    Smith, JD
    Kruglyak, L
    Nickerson, DA
    [J]. NATURE GENETICS, 2003, 33 (04) : 518 - 521
  • [4] Detecting disease associations due to linkage disequilibrium using haplotype tags: A class of tests and the determinants of statistical power
    Chapman, JM
    Cooper, JD
    Todd, JA
    Clayton, DG
    [J]. HUMAN HEREDITY, 2003, 56 (1-3) : 18 - 31
  • [5] CLARK AG, 1990, MOL BIOL EVOL, V7, P111
  • [6] Evidence for substantial fine-scale variation in recombination rates across the human genome
    Crawford, DC
    Bhangale, T
    Li, N
    Hellenthal, G
    Rieder, MJ
    Nickerson, DA
    Stephens, M
    [J]. NATURE GENETICS, 2004, 36 (07) : 700 - 706
  • [7] High-resolution haplotype structure in the human genome
    Daly, MJ
    Rioux, JD
    Schaffner, SE
    Hudson, TJ
    Lander, ES
    [J]. NATURE GENETICS, 2001, 29 (02) : 229 - 232
  • [8] Eskin Eleazar, 2003, J Bioinform Comput Biol, V1, P1, DOI 10.1142/S0219720003000174
  • [9] Evans G., 1993, PRACTICAL NUMERICAL
  • [10] EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921