StatAlign: an extendable software package for joint Bayesian estimation of alignments and evolutionary trees

被引:62
作者
Novak, Adam [1 ]
Miklos, Istvan [1 ]
Lyngso, Rune [1 ]
Hein, Jotun [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford OX1 3TG, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1093/bioinformatics/btn457
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Bayesian analysis is one of the most popular methods in phylogenetic inference. The most commonly used methods fix a single multiple alignment and consider only substitutions as phylogenetically informative mutations, though alignments and phylogenies should be inferred jointly as insertions and deletions also carry informative signals. Methods addressing these issues have been developed only recently and there has not been so far a user-friendly program with a graphical interface that implements these methods. Results: We have developed an extendable software package in the Java programming language that samples from the joint posterior distribution of phylogenies, alignments and evolutionary parameters by applying the Markov chain Monte Carlo method. The package also offers tools for efficient on-the-fly summarization of the results. It has a graphical interface to configure, start and supervise the analysis, to track the status of the Markov chain and to save the results. The background model for insertions and deletions can be combined with any substitution model. It is easy to add new substitution models to the software package as plugins. The samples from the Markov chain can be summarized in several ways, and new postprocessing plugins may also be installed.
引用
收藏
页码:2403 / 2404
页数:2
相关论文
共 18 条
[1]  
Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
[2]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[3]   Simultaneous statistical multiple alignment and phylogeny reconstruction [J].
Fleissner, R ;
Metzler, D ;
Von Haeseler, A .
SYSTEMATIC BIOLOGY, 2005, 54 (04) :548-561
[4]   Phylogenetic information and experimental design in molecular systematics [J].
Goldman, N .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1998, 265 (1407) :1779-1786
[5]   Evolutionary HMMs: a Bayesian approach to multiple alignment [J].
Holmes, I ;
Bruno, WJ .
BIOINFORMATICS, 2001, 17 (09) :803-820
[6]   Dynamic programming alignment accuracy [J].
Holmes, I ;
Durbin, R .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (03) :493-504
[7]  
JUKES T H, 1969, P21
[8]   Bayesian coestimation of phylogeny and sequence alignment -: art. no. 83 [J].
Lunter, G ;
Miklós, I ;
Drummond, A ;
Jensen, JL ;
Hein, J .
BMC BIOINFORMATICS, 2005, 6 (1)
[9]   A "long indel" model for evolutionary sequence alignment [J].
Miklós, I ;
Lunter, GA ;
Holmes, I .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (03) :529-540
[10]   How reliably can we predict the reliability of protein structure predictions? [J].
Miklos, Istvan ;
Novak, Adam ;
Dombai, Balazs ;
Hein, Jotun .
BMC BIOINFORMATICS, 2008, 9 (1)