Prospects for inferring very large phylogenies by using the neighbor-joining method

被引:4085
作者
Tamura, K
Nei, M
Kumar, S [1 ]
机构
[1] Arizona State Univ, Ctr Evolutionary Funct Genomics, Biodesign Inst, Tempe, AZ 85287 USA
[2] Arizona State Univ, Sch Life Sci, Tempe, AZ 85287 USA
[3] Tokyo Metropolitan Univ, Dept Biol Sci, Tokyo 1920397, Japan
[4] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
关键词
phylogenetics; molecular evolution; distance estimation; tree of life; maximum likelihood;
D O I
10.1073/pnas.0404206101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Current efforts to reconstruct the tree of life and histories of multigene families demand the inference of phylogenies consisting of thousands of gene sequences. However, for such large data sets even a moderate exploration of the tree space needed to identify the optimal tree is virtually impossible. For these cases the neighbor-joining (NJ) method is frequently used because of its demonstrated accuracy for smaller data sets and its computational speed. As data sets grow, however, the fraction of the tree space examined by the NJ algorithm becomes minuscule. Here, we report the results of our computer simulation for examining the accuracy of NJ trees for inferring very large phylogenies. First we present a likelihood method for the simultaneous estimation of all pairwise distances by using biologically realistic models of nucleotide substitution. Use of this method corrects up to 60% of NJ tree errors. Our simulation results show that the accuracy of NJ trees decline only by approximate to5% when the number of sequences used increases from 32 to 4,096 (128 times) even in the presence of extensive variation in the evolutionary rate among lineages or significant biases in the nucleotide composition and transition/transversion ratio. Our results encourage the use of complex models of nucleotide substitution for estimating evolutionary distances and hint at bright prospects for the application of the NJ and related methods in inferring large phylogenies.
引用
收藏
页码:11030 / 11035
页数:6
相关论文
共 32 条