A Two-Stage Pruning Algorithm for Likelihood Computation for a Population Tree

被引:35
作者
RoyChoudhury, Arindam [1 ]
Felsenstein, Joseph [2 ]
Thompson, Elizabeth A. [3 ]
机构
[1] Harvard Univ, Biol Labs 4092 4100, Wakeley Lab, Dept Organism & Evolutionary Biol, Cambridge, MA 02138 USA
[2] Univ Washington, Dept Genome, Seattle, WA 98195 USA
[3] Univ Washington, Dept Stat, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1534/genetics.107.085753
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We have developed a pruning algorithm for likelihood estimation of a tree of populations. This algorithm enables us to Compute the likelihood for large trees. Thus, it gives an efficient way of obtaining the maximum-likelihood estimate (MLE) for a given tree topology. Our method utilizes the differences accumulated by random genetic drift in allele count data from single-nucleotide polymorphisms (SNPs), ignoring the effect of mutation after divergence from the common ancestral population. The computation of the maximum-likelihood tree involves both maximizing likelihood over branch lengths of a given topology and comparing the maximum-likelihood across topologies. Here our focus is the maximization of likelihood over branch lengths of a given topology. The pruning algorithm computes arrays of probabilities at the root of the tree from the data at the tips of the tree; at the root, the arrays determine the likelihood. The arrays consist of probabilities related to the number of coalescences and allele counts for the partially coalesced lineages. Computing these probabilities requires an unusual two-stage algorithm. Our computation is exact and avoids time-consuming Monte Carlo methods. We can also correct for ascertainment bias.
引用
收藏
页码:1095 / 1105
页数:11
相关论文
共 20 条
[1]   PHYLOGENETIC ANALYSIS - MODELS AND ESTIMATION PROCEDURES [J].
CAVALLISFORZA, LL ;
EDWARDS, AWF .
EVOLUTION, 1967, 21 (03) :550-+
[2]   Ascertainment bias in studies of human genome-wide polymorphism [J].
Clark, AG ;
Hubisz, MJ ;
Bustamante, CD ;
Williamson, SH ;
Nielsen, R .
GENOME RESEARCH, 2005, 15 (11) :1496-1502
[3]  
EDWARDS A. W. F., 1964, SYST ASS PUBLICATION, V6, P67
[4]   GENERAL MODEL FOR GENETIC ANALYSIS OF PEDIGREE DATA [J].
ELSTON, RC ;
STEWART, J .
HUMAN HEREDITY, 1971, 21 (06) :523-&
[5]  
FELSENSTEIN J, 1973, AM J HUM GENET, V25, P471
[6]   MAXIMUM LIKELIHOOD AND MINIMUM-STEPS METHODS FOR ESTIMATING EVOLUTIONARY TREES FROM DATA ON DISCRETE CHARACTERS [J].
FELSENSTEIN, J .
SYSTEMATIC ZOOLOGY, 1973, 22 (03) :240-249
[7]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[8]  
FELSENSTEIN J, 1968, THESIS U CHICAGO CHI
[9]  
Felsenstein Joseph, 2004, Inferring_phylogenies, V2
[10]  
Foulds L.R., 1982, ADV APPL MATH, V3, P43, DOI DOI 10.1016/S0196-8858(82)80004-3