Column sorting: Rapid calculation of the phylogenetic likelihood function

被引:20
作者
Pond, SLK
Muse, SV
机构
[1] Univ Arizona, Dept Math, Tucson, AZ 85721 USA
[2] N Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
[3] N Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
关键词
DNA sequence; graph theory; likelihood; phylogeny; traveling salesman problem;
D O I
10.1080/10635150490522269
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Likelihood applications have become a central approach for molecular evolutionary analyses since the first computationally tractable treatment two decades ago. Although Felsenstein's original pruning algorithm makes likelihood calculations feasible, it is usually possible to take advantage of repetitive structure present in the data to arrive at even greater computational reductions. In particular, alignment columns with certain similarities have components of the likelihood calculation that are identical and need not be recomputed if columns are evaluated in an optimal order. We develop an algorithm for exploiting this speed improvement via an application of graph theory. The reductions provided by the method depend on both the tree and the data, but typical savings range between 15% and 50%. Real-data examples with time reductions of 80% have been identified. The overhead costs associated with implementing the algorithm are minimal, and they are recovered in all but the smallest data sets. The modifications will provide faster likelihood algorithms, which will allow likelihood methods to be applied to larger sets of taxa and to include more thorough searches of the tree topology space.
引用
收藏
页码:685 / 692
页数:8
相关论文
共 12 条
[1]  
CHERITON D, 1972, SIAM J COMPUT, V19, P724
[2]  
Christofides N., 1976, tech. rep.
[3]   A hidden Markov Model approach to variation among sites in rate of evolution [J].
Felsenstein, J ;
Churchill, GA .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (01) :93-104
[4]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[5]  
Felsenstein J., 1993, PHYLIP PHYLOGENY INF
[6]  
Gibbons A., 1985, ALGORITHMIC GRAPH TH
[7]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174
[8]  
Jukes TH, 1969, MAMMALIAN PROTEIN ME, P21, DOI [DOI 10.1016/B978-1-4832-3211-9.50009-7, DOI 10.1093/BIOINFORMATICS/BTM404]
[9]  
Karp R. M., 1972, COMPLEXITY COMPUTER, P85, DOI DOI 10.1007/978-1-4684-2001-2_9