A new statistical method for haplotype reconstruction from population data

被引:6614
作者
Stephens, M
Smith, NJ
Donnelly, P
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[2] Univ Oxford, Dept Stat, Oxford OX1 2JD, England
[3] Univ Oxford, Dept Biochem, Oxford OX1 2JD, England
基金
英国生物技术与生命科学研究理事会; 英国惠康基金; 英国工程与自然科学研究理事会;
关键词
D O I
10.1086/319501
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Current routine genotyping methods typically do not provide haplotype information, which is essential for many analyses of fine-scale molecular-genetics data. Haplotypes can be obtained, at considerable cost, experimentally or (partially) through genotyping of additional family members. Alternatively, a statistical method can be used to infer phase and to reconstruct haplotypes. We present a new statistical method, applicable to genotype data at linked loci from a population sample, that improves substantially on current algorithms; often, error rates are reduced by >50%, relative to its nearest competitor. Furthermore, our algorithm performs well in absolute terms, suggesting that reconstructing haplotypes experimentally or by genotyping additional family members may be an inefficient use of resources.
引用
收藏
页码:978 / 989
页数:12
相关论文
共 25 条
[1]  
CLARK AG, 1990, MOL BIOL EVOL, V7, P111
[2]   PARTITION STRUCTURES, POLYA URNS, THE EWENS SAMPLING FORMULA, AND THE AGES OF ALLELES [J].
DONNELLY, P .
THEORETICAL POPULATION BIOLOGY, 1986, 30 (02) :271-288
[3]   Incorporating genotypes of relatives into a test of linkage disequilibrium [J].
Excoffier, L ;
Slatkin, M .
AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 62 (01) :171-180
[4]  
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
[5]   Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data [J].
Fallin, D ;
Schork, NJ .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (04) :947-959
[6]  
Gelman A., 1992, STAT SCI, V7, P457, DOI DOI 10.1214/SS/1177011136
[7]  
Harding RM, 1997, AM J HUM GENET, V60, P772
[8]   HAPLO - A PROGRAM USING THE EM ALGORITHM TO ESTIMATE THE FREQUENCIES OF MULTISITE HAPLOTYPES [J].
HAWLEY, ME ;
KIDD, KK .
JOURNAL OF HEREDITY, 1995, 86 (05) :409-411
[9]   Dependency networks for inference, collaborative filtering, and data visualization [J].
Heckerman, D ;
Chickering, DM ;
Meek, C ;
Rounthwaite, R ;
Kadie, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (01) :49-75
[10]   Loss of information due to ambiguous haplotyping of SNPs [J].
Hodge, SE ;
Boehnke, M ;
Spence, MA .
NATURE GENETICS, 1999, 21 (04) :360-361