Evolutionary HMMs: a Bayesian approach to multiple alignment

被引:103
作者
Holmes, I [1 ]
Bruno, WJ [1 ]
机构
[1] Los Alamos Natl Lab, Grp T10, Los Alamos, NM 87545 USA
关键词
D O I
10.1093/bioinformatics/17.9.803
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: We review proposed syntheses of probabilistic sequence alignment, profiling and phylogeny. We develop a multiple alignment algorithm for Bayesian inference in the links model proposed by Thorne et el. (1991, J. Mol, Evol., 33, 114-124). The algorithm, described in detail in Section 3, samples from and/or maximizes the posterior distribution over multiple alignments for any number of DNA or protein sequences, conditioned on a phylogenetic tree. The individual sampling and maximization steps of the algorithm require no more computational resources than pairwise alignment. Methods: We present a software implementation (Handel) of our algorithm and report test results on (I) simulated data sets and (ii) the structurally informed protein alignments of BALiBASE Results: We find that the mean sum-of-pairs score (a measure of residue-pair correspondence) for the BAliBASE alignments is only 13% lower for Handel than for CLUSTALW (Thompson et al., 1994, Nucleic Acids Res., 22, 4673-4680), despite the relative simplicity of the links model (CLUSTALW uses affine gap scores and increased penalties for indels in hydrophobic regions). With reference to these benchmarks, we discuss potential improvements to the links model and implications for Bayesian multiple alignment and phylogenetic profiling.
引用
收藏
页码:803 / 820
页数:18
相关论文
共 49 条