Simultaneous statistical multiple alignment and phylogeny reconstruction

被引:61
作者
Fleissner, R
Metzler, D
Von Haeseler, A
机构
[1] Univ Idaho, Dept Math, IBEST, Moscow, ID 83844 USA
[2] Goethe Univ Frankfurt, D-60054 Frankfurt, Germany
[3] Univ Dusseldorf, Inst Bioinformat, D-40225 Dusseldorf, Germany
[4] John von Neumann Inst Comp, Forsch Grp Bioinformat, D-52425 Julich, Germany
关键词
multiple sequence alignment; statistical alignment; TKF model; tree reconstruction;
D O I
10.1080/10635150590950371
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Although the reconstruction of phylogenetic trees and the computation of multiple sequence alignments are highly interdependent, these two areas of research lead quite separate lives, the former often making use of stochastic modeling, whereas the latter normally does not. Despite the fact that reasonable insertion and deletion models for sequence pairs were already introduced more than 10 years ago, they have only recently been applied to multiple alignment and only in their simplest version. In this paper we present and discuss a strategy based on simulated annealing, which makes use of these models to infer a phylogenetic tree for a set of DNA or protein sequences together with the sequences' indel history, i.e., their multiple alignment augmented with information about the positioning of insertion and deletion events in the tree. Our method is also the first application of the TKF2 model in the context of multiple sequence alignment. We validate the method via simulations and illustrate it using a data set of primate mtDNA.
引用
收藏
页码:548 / 561
页数:14
相关论文
共 42 条
[31]   A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis (Reprinted from Gene Combis, vol 167, pg GC1-GC10, 1996) [J].
Sonnhammer, ELL ;
Durbin, R .
GENE, 1995, 167 (1-2) :GC1-GC10
[32]   Applying the Thorne-Kishino-Felsenstein model to sequence evolution on a star-shaped tree [J].
Steel, M ;
Hein, J .
APPLIED MATHEMATICS LETTERS, 2001, 14 (06) :679-684
[33]   Quartet puzzling: A quartet maximum-likelihood method for reconstructing tree topologies [J].
Strimmer, K ;
vonHaeseler, A .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (07) :964-969
[34]  
Swofford David L., 1996, P407
[35]   ESTIMATION OF THE NUMBER OF NUCLEOTIDE SUBSTITUTIONS IN THE CONTROL REGION OF MITOCHONDRIAL-DNA IN HUMANS AND CHIMPANZEES [J].
TAMURA, K ;
NEI, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1993, 10 (03) :512-526
[36]  
Tavar? S., 1986, LECT MATH LIFE SCI, V17, P57, DOI DOI 10.1016/J.MARPOLBUL.2009.11.011
[37]   CLUSTAL-W - IMPROVING THE SENSITIVITY OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT THROUGH SEQUENCE WEIGHTING, POSITION-SPECIFIC GAP PENALTIES AND WEIGHT MATRIX CHOICE [J].
THOMPSON, JD ;
HIGGINS, DG ;
GIBSON, TJ .
NUCLEIC ACIDS RESEARCH, 1994, 22 (22) :4673-4680
[38]   INCHING TOWARD REALITY - AN IMPROVED LIKELIHOOD MODEL OF SEQUENCE EVOLUTION [J].
THORNE, JL ;
KISHINO, H ;
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1992, 34 (01) :3-16
[39]   AN EVOLUTIONARY MODEL FOR MAXIMUM-LIKELIHOOD ALIGNMENT OF DNA-SEQUENCES [J].
THORNE, JL ;
KISHINO, H ;
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1991, 33 (02) :114-124
[40]  
THORNE JL, 1992, MOL BIOL EVOL, V9, P1148