Accuracy of neighbor joining for n-taxon trees

被引:18
作者
Strimmer, K
vonHaeseler, A
机构
关键词
assigning edge lengths; Felsenstein zone; finite sequence length; Jukes-Cantor model; Monte Carlo sampling; neighbor joining;
D O I
10.2307/2413528
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A Monte Carlo approach was used to estimate the accuracy of a given tree reconstruction method for any number of taxa. In this procedure, we sampled randomly over all possible bifurcating trees assigning substitution rates (branch lengths) to each edge from an exponential distribution to obtain a biologically sensible maximal observed distance. Three different sets of trees were studied: the unrestricted tree space, the biologically meaningful tree space as introduced by Nei et al. (1995, Science 267:253-254), and the population data tree space. We used this technique to elucidate the performance of neighbor joining asci-function of the number of taxa, assuming that distances are uncorrected and sequences evolve according to the Jukes-Cantor model. The accuracy of neighbor joining decreases almost exponentially with the number of taxa. However, the rate of decrease depends on the tree space studied. Although the accuracy decreases towards zero, the similarity, i.e., the number of partitions that are identical between model tree and reconstructed tree, is in all cases studied much higher than the value expected for two randomly chosen trees. Although the probability of recovering the true tree is dramatically influenced by sequence length, the average similarity does not decrease substantially if branch lengths are not too short.
引用
收藏
页码:516 / 523
页数:8
相关论文
共 21 条
[1]  
HEDGES SB, 1992, SCIENCE, V255, P737, DOI 10.1126/science.1738849
[2]  
Hendy M. D., 1988, Classification and Related Methods of Data Analysis. Proceedings of the First Conference of the International Federation of Classification Societies (IFCS), P355
[3]   APPLICATION AND ACCURACY OF MOLECULAR PHYLOGENIES [J].
HILLIS, DM ;
HUELSENBECK, JP ;
CUNNINGHAM, CW .
SCIENCE, 1994, 264 (5159) :671-677
[4]   SUCCESS OF PHYLOGENETIC METHODS IN THE 4-TAXON CASE [J].
HUELSENBECK, JP ;
HILLIS, DM .
SYSTEMATIC BIOLOGY, 1993, 42 (03) :247-264
[5]   PERFORMANCE OF PHYLOGENETIC METHODS IN SIMULATION [J].
HUELSENBECK, JP .
SYSTEMATIC BIOLOGY, 1995, 44 (01) :17-48
[6]  
JIN L, 1990, MOL BIOL EVOL, V7, P82
[7]  
JUKES T H, 1969, P21
[8]  
KUHNER MK, 1994, MOL BIOL EVOL, V11, P459
[9]   ASSESSING MOLECULAR PHYLOGENIES [J].
NEI, M ;
TAKEZAKI, N ;
SITNIKOVA, T .
SCIENCE, 1995, 267 (5195) :253-255
[10]  
SAITOU N, 1989, MOL BIOL EVOL, V6, P514