Accuracy of neighbor joining for n-taxon trees

被引:18
作者
Strimmer, K
vonHaeseler, A
机构
关键词
assigning edge lengths; Felsenstein zone; finite sequence length; Jukes-Cantor model; Monte Carlo sampling; neighbor joining;
D O I
10.2307/2413528
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A Monte Carlo approach was used to estimate the accuracy of a given tree reconstruction method for any number of taxa. In this procedure, we sampled randomly over all possible bifurcating trees assigning substitution rates (branch lengths) to each edge from an exponential distribution to obtain a biologically sensible maximal observed distance. Three different sets of trees were studied: the unrestricted tree space, the biologically meaningful tree space as introduced by Nei et al. (1995, Science 267:253-254), and the population data tree space. We used this technique to elucidate the performance of neighbor joining asci-function of the number of taxa, assuming that distances are uncorrected and sequences evolve according to the Jukes-Cantor model. The accuracy of neighbor joining decreases almost exponentially with the number of taxa. However, the rate of decrease depends on the tree space studied. Although the accuracy decreases towards zero, the similarity, i.e., the number of partitions that are identical between model tree and reconstructed tree, is in all cases studied much higher than the value expected for two randomly chosen trees. Although the probability of recovering the true tree is dramatically influenced by sequence length, the average similarity does not decrease substantially if branch lengths are not too short.
引用
收藏
页码:516 / 523
页数:8
相关论文
共 21 条
[11]   THE NEIGHBOR-JOINING METHOD - A NEW METHOD FOR RECONSTRUCTING PHYLOGENETIC TREES [J].
SAITOU, N ;
NEI, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1987, 4 (04) :406-425
[12]   Performance of the maximum likelihood, neighbor joining, and maximum parsimony methods when sequence sites are not independent [J].
Schoniger, M ;
vonHaeseler, A .
SYSTEMATIC BIOLOGY, 1995, 44 (04) :533-547
[13]  
SCHONIGER M, 1993, MOL BIOL EVOL, V10, P471
[14]  
SOURDIS J, 1987, MOL BIOL EVOL, V4, P159
[15]  
SOURDIS J, 1988, MOL BIOL EVOL, V5, P298
[16]   DISTRIBUTIONS OF TREE COMPARISON METRICS - SOME NEW RESULTS [J].
STEEL, MA ;
PENNY, D .
SYSTEMATIC BIOLOGY, 1993, 42 (02) :126-141
[17]   STATISTICAL PROPERTIES OF MOLECULAR TREE CONSTRUCTION METHODS UNDER THE NEUTRAL MUTATION MODEL [J].
TATENO, Y ;
TAJIMA, F .
JOURNAL OF MOLECULAR EVOLUTION, 1986, 23 (04) :354-361
[18]   ACCURACY OF ESTIMATED PHYLOGENETIC TREES FROM MOLECULAR-DATA .1. DISTANTLY RELATED SPECIES [J].
TATENO, Y ;
NEI, M ;
TAJIMA, F .
JOURNAL OF MOLECULAR EVOLUTION, 1982, 18 (06) :387-404
[19]  
TATENO Y, 1994, MOL BIOL EVOL, V11, P261
[20]   AFRICAN POPULATIONS AND THE EVOLUTION OF HUMAN MITOCHONDRIAL-DNA [J].
VIGILANT, L ;
STONEKING, M ;
HARPENDING, H ;
HAWKES, K ;
WILSON, AC .
SCIENCE, 1991, 253 (5027) :1503-1507