Disk-covering, a fast-converging method for phylogenetic tree reconstruction

被引:106
作者
Huson, DH
Nettles, SM
Warnow, TJ
机构
[1] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA
[2] Celera Genom, Rockville, MD USA
[3] Univ Texas, Dept Elect & Comp Engn, Austin, TX 78712 USA
关键词
algorithms; evolution; phylogenetic trees; clustering; Jukes Cantor; biomolecular data; chordal graphs;
D O I
10.1089/106652799318337
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The evolutionary history of a set of species is represented by a phylogenetic tree, which is a rooted, leaf-labeled tree, where internal nodes represent ancestral species and the leaves represent modern day species, Accurate (or even boundedly inaccurate) topology reconstructions of large and divergent trees from realistic length sequences have long been considered one of the major challenges in systematic biology. In this paper, we present a simple method, the Disk-Covering Method (DCM), which boosts the performance of base phylogenetic methods under various Markov models of evolution, We analyze the performance of DCM-boosted distance methods under the Jukes-Cantor Markov model of biomolecular sequence evolution, and prove that for almost all trees, polylogarithmic length sequences suffice for complete accuracy with high probability, while polynomial length sequences always suffice. We also provide an experimental study based upon simulating sequence evolution on model trees. This study confirms substantial reductions in error rates at realistic sequence lengths.
引用
收藏
页码:369 / 386
页数:18
相关论文
共 46 条
[1]  
Agarwala R, 1996, PROCEEDINGS OF THE SEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P365
[2]  
AMBAINIS R, 1997, P 38 ANN IEEE S FDN, P524
[3]  
ATTESON K, 1997, LECT NOTES COMPUTER, V1273, P101
[4]   A CANONICAL DECOMPOSITION-THEORY FOR METRICS ON A FINITE-SET [J].
BANDELT, HJ ;
DRESS, AWM .
ADVANCES IN MATHEMATICS, 1992, 92 (01) :47-105
[5]  
Buneman P., 1974, Discrete Mathematics, V9, P205, DOI 10.1016/0012-365X(74)90002-8
[6]  
Buneman P., 1971, Mathematics in the Archeological and Historical Sciences, P387
[7]   Evolutionary trees can be learned in polynomial time in the two-state general Markov model [J].
Cryan, M ;
Goldberg, LA ;
Goldberg, PW .
39TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1998, :436-445
[8]  
Csürös M, 1999, PROCEEDINGS OF THE TENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P261
[9]  
DAY W, 1995, J CLASS, V2, P7
[10]  
DOBZHANSKY T, 1993, AM BIOL TEACHER MAR, P125