Fast recovery of evolutionary trees with thousands of nodes

被引:21
作者
Csúrös, M [1 ]
机构
[1] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
关键词
phylogeny; evolutionary tree reconstruction; distance-based method; harmonic greedy triplets;
D O I
10.1089/10665270252935467
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present a novel distance-based algorithm for evolutionary tree reconstruction. Our algorithm reconstructs the topology of a tree with n leaves in O(n 2) time using O(n) working space. In the general Markov model of evolution, the algorithm recovers the topology successfully with (1 - o(l)) probability from sequences with polynomial length in n. Moreover, for almost all trees, our algorithm achieves the same success probability on polylogarithmic sample sizes. The theoretical results are supported by simulation experiments involving trees with 500, 1,895, and 3,135 leaves. The topologies of the trees are recovered with high success from 2,000 bp DNA sequences.
引用
收藏
页码:277 / 297
页数:21
相关论文
共 37 条
  • [1] [Anonymous], 1971, STAT DECISION THEORY
  • [2] [Anonymous], THESIS YALE U
  • [3] The performance of neighbor-joining methods of phylogenetic reconstruction
    Atteson, K
    [J]. ALGORITHMICA, 1999, 25 (2-3) : 251 - 278
  • [4] RECONSTRUCTING THE SHAPE OF A TREE FROM OBSERVED DISSIMILARITY DATA
    BANDELT, HJ
    DRESS, A
    [J]. ADVANCES IN APPLIED MATHEMATICS, 1986, 7 (03) : 309 - 343
  • [5] Phylogeny - Deep Green rewrites evolutionary history of plants
    Brown, KS
    [J]. SCIENCE, 1999, 285 (5430) : 990 - 991
  • [6] Weighted neighbor joining: A likelihood-based approach to distance-based phylogeny reconstruction
    Bruno, WJ
    Socci, ND
    Halpern, AL
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) : 189 - 197
  • [7] BRZUSTOWSKI J, 1998, QCLUST V0 2
  • [8] Buneman P., 1971, Mathematics in the Archaeological and Historical Sciences, P387
  • [9] PHYLOGENETICS OF SEED PLANTS - AN ANALYSIS OF NUCLEOTIDE-SEQUENCES FROM THE PLASTID GENE RBCL
    CHASE, MW
    SOLTIS, DE
    OLMSTEAD, RG
    MORGAN, D
    LES, DH
    MISHLER, BD
    DUVALL, MR
    PRICE, RA
    HILLS, HG
    QIU, YL
    KRON, KA
    RETTIG, JH
    CONTI, E
    PALMER, JD
    MANHART, JR
    SYTSMA, KJ
    MICHAELS, HJ
    KRESS, WJ
    KAROL, KG
    CLARK, WD
    HEDREN, M
    GAUT, BS
    JANSEN, RK
    KIM, KJ
    WIMPEE, CF
    SMITH, JF
    FURNIER, GR
    STRAUSS, SH
    XIANG, QY
    PLUNKETT, GM
    SOLTIS, PS
    SWENSEN, SM
    WILLIAMS, SE
    GADEK, PA
    QUINN, CJ
    EGUIARTE, LE
    GOLENBERG, E
    LEARN, GH
    GRAHAM, SW
    BARRETT, SCH
    DAYANANDAN, S
    ALBERT, VA
    [J]. ANNALS OF THE MISSOURI BOTANICAL GARDEN, 1993, 80 (03) : 528 - 580
  • [10] Evolutionary trees can be learned in polynomial time in the two-state general Markov model
    Cryan, M
    Goldberg, LA
    Goldberg, PW
    [J]. 39TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1998, : 436 - 445