A Two-Stage Pruning Algorithm for Likelihood Computation for a Population Tree

被引:35
作者
RoyChoudhury, Arindam [1 ]
Felsenstein, Joseph [2 ]
Thompson, Elizabeth A. [3 ]
机构
[1] Harvard Univ, Biol Labs 4092 4100, Wakeley Lab, Dept Organism & Evolutionary Biol, Cambridge, MA 02138 USA
[2] Univ Washington, Dept Genome, Seattle, WA 98195 USA
[3] Univ Washington, Dept Stat, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1534/genetics.107.085753
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We have developed a pruning algorithm for likelihood estimation of a tree of populations. This algorithm enables us to Compute the likelihood for large trees. Thus, it gives an efficient way of obtaining the maximum-likelihood estimate (MLE) for a given tree topology. Our method utilizes the differences accumulated by random genetic drift in allele count data from single-nucleotide polymorphisms (SNPs), ignoring the effect of mutation after divergence from the common ancestral population. The computation of the maximum-likelihood tree involves both maximizing likelihood over branch lengths of a given topology and comparing the maximum-likelihood across topologies. Here our focus is the maximization of likelihood over branch lengths of a given topology. The pruning algorithm computes arrays of probabilities at the root of the tree from the data at the tips of the tree; at the root, the arrays determine the likelihood. The arrays consist of probabilities related to the number of coalescences and allele counts for the partially coalesced lineages. Computing these probabilities requires an unusual two-stage algorithm. Our computation is exact and avoids time-consuming Monte Carlo methods. We can also correct for ascertainment bias.
引用
收藏
页码:1095 / 1105
页数:11
相关论文
共 20 条
[11]   UNLIKELIHOOD THAT MINIMAL PHYLOGENIES FOR A REALISTIC BIOLOGICAL STUDY CAN BE CONSTRUCTED IN REASONABLE COMPUTATIONAL TIME [J].
GRAHAM, RL ;
FOULDS, LR .
MATHEMATICAL BIOSCIENCES, 1982, 60 (02) :133-142
[12]  
HEUCH I, 1972, CLIN GENET, V3, P501, DOI 10.1111/j.1399-0004.1972.tb01488.x
[13]  
HILDEN J, 1970, Clinical Genetics, V1, P319
[14]  
Moran P.A.P., 1962, STAT PROCESSES EVOLU
[15]  
Nielsen R, 1998, EVOLUTION, V52, P669, DOI [10.1111/j.1558-5646.1998.tb03692.x, 10.2307/2411262]
[16]   Likelihood analysis of ongoing gene flow and historical association [J].
Nielsen, R ;
Slatkin, M .
EVOLUTION, 2000, 54 (01) :44-50
[17]  
TAKAHATA N, 1985, GENETICS, V110, P325
[18]  
Thompson E.A., 1975, Human evolutionary trees
[19]   The SNP Consortium website: past, present and future [J].
Thorisson, GA ;
Stein, LD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :124-127
[20]  
Wright S, 1931, GENETICS, V16, P0097