Maximum likelihood of evolutionary trees: hardness and approximation

被引:51
作者
Chor, B [1 ]
Tuller, T [1 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
基金
以色列科学基金会;
关键词
D O I
10.1093/bioinformatics/bti1027
中图分类号
Q5 [生物化学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Motivation: Maximum likelihood (ML) is an increasingly popular optimality criterion for selecting evolutionary trees. Yet the computational complexity of ML was open for over 20 years, and only recently resolved by the authors for the Jukes-Cantor model of substitution and its generalizations. It was proved that reconstructing the ML tree is computationally intractable (NP-hard). In this work we explore three directions, which extend that result. Results: (1) We show that ML under the assumption of molecular clock is still computationally intractable (NP-hard). (2) We show that not only is it computationally intractable to find the exact ML tree, even approximating the logarithm of the ML for any multiplicative factor smaller than 1.00175 is computationally intractable. (3) We develop an algorithm for approximating log-likelihood under the condition that the input sequences are sparse. It employs any approximation algorithm for parsimony, and asymptotically achieves the same approximation ratio. We note that ML reconstruction for sparse inputs is still hard under this condition, and furthermore many real datasets satisfy it.
引用
收藏
页码:I97 / I106
页数:10
相关论文
共 43 条
[1]
Addario-Berry Louigi, 2004, J Bioinform Comput Biol, V2, P257, DOI 10.1142/S0219720004000557
[2]
Evolution of α2-fucosyltransferase genes in primates:: Relation between an intronic Alu-Y element and red cell expression of ABH antigens [J].
Apoil, PA ;
Roubinet, F ;
Despiau, S ;
Mollicone, R ;
Oriol, R ;
Blancher, A .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (03) :337-351
[3]
AUSIELLO G, 1998, COMPLEXITY APPROXIMA
[4]
Berman P., 1999, Automata, Languages and Programming. 26th International Colloquium, ICALP'99. Proceedings (Lecture Notes in Computer Science Vol.1644), P200
[5]
Better methods for solving parsimony and compatibility [J].
Bonet, M ;
Steel, M ;
Warnow, T ;
Yooseph, S .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (03) :391-407
[6]
Full reconstruction of Markov models on evolutionary trees: Identifiability and consistency [J].
Chang, JT .
MATHEMATICAL BIOSCIENCES, 1996, 137 (01) :51-73
[7]
Multiple maxima of likelihood in phylogenetic trees: An analytic approach [J].
Chor, B ;
Hendy, MD ;
Holland, BR ;
Penny, D .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (10) :1529-1541
[8]
CHOR B, 2005, P 9 ANN INT C RES CO
[9]
Provably fast and accurate recovery of evolutionary trees through harmonic greedy triplets [J].
Csurös, M ;
Kao, MY .
SIAM JOURNAL ON COMPUTING, 2001, 31 (01) :306-322