IQPNNI: Moving fast through tree space and stopping in time

被引:102
作者
Vinh, LS
von Haeseler, A [1 ]
机构
[1] Univ Dusseldorf, D-4000 Dusseldorf, Germany
[2] Forsch Zentrum Julich, Julich, Germany
关键词
sequence evolution; maximum likelihood; tree reconstruction; phylogenetics; quartet tree; nearest neighbor interchange; accuracy; rbcL; record value;
D O I
10.1093/molbev/msh176
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An efficient tree reconstruction method (IQPNNI) is introduced to reconstruct a phylogenetic tree based on DNA or amino acid sequence data. Our approach combines various fast algorithms to generate a list of potential candidate trees. The key ingredient is the definition of so-called important quartets (IQs), which allow the computation of an intermediate tree in O(n(2)) time for n sequences. The resulting tree is then further optimized by applying the nearest neighbor interchange (NNI) operation. Subsequently a random fraction of the sequences is deleted from the best tree found so far. The deleted sequences are then re-inserted in the smaller tree using the important quartet puzzling (IQP) algorithm. These steps are repeated several times and the best tree, with respect to the likelihood criterion, is considered as the inferred phylogenetic tree. Moreover, we suggest a rule which indicates when to stop the search. Simulations show that IQPNNI gives a slightly better accuracy than other programs tested. Moreover, we applied the approach to 218 small subunit rRNA sequences and 500 rbcL sequences. We found trees with higher likelihood compared to the results by others. A program to reconstruct DNA or amino acid based phylogenetic trees is available online (http://www.bi.uni-duesseldorf.de/software/iqpnni).
引用
收藏
页码:1565 / 1571
页数:7
相关论文
共 29 条
[1]   Genetic algorithms and parallel processing in maximum-likelihood phylogeny inference [J].
Brauer, MJ ;
Holder, MT ;
Dries, LA ;
Zwickl, DJ ;
Lewis, PO ;
Hillis, DM .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (10) :1717-1726
[2]   Weighted neighbor joining: A likelihood-based approach to distance-based phylogeny reconstruction [J].
Bruno, WJ ;
Socci, ND ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) :189-197
[3]   Hitch-hiking: A parallel heuristic search strategy, applied to the phylogeny problem [J].
Charleston, MA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (01) :79-91
[4]  
COOKE P, 1980, BIOMETRIKA, V67, P257
[5]  
FELSENSTEIN J, 1993, PHYLIP VERSION 3 5C
[6]  
Felsenstein Joseph, 2004, Inferring_phylogenies, V2
[7]   BIONJ: An improved version of the NJ algorithm based on a simple model of sequence data [J].
Gascuel, O .
MOLECULAR BIOLOGY AND EVOLUTION, 1997, 14 (07) :685-695
[8]   A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood [J].
Guindon, S ;
Gascuel, O .
SYSTEMATIC BIOLOGY, 2003, 52 (05) :696-704
[9]  
Harding E.F., 1971, Adv. Appl. Prob, V3, P44, DOI DOI 10.2307/1426329
[10]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174