A Markov chain Monte Carlo expectation maximization algorithm for statistical analysis of DNA sequence evolution with neighbor-dependent substitution rates

被引:18
作者
Hobolth, Asger [1 ]
机构
[1] N Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
关键词
EM-algorithm; Gibbs sampling; likelihood inference; molecular evolution; neighbor-dependence; path sampling;
D O I
10.1198/106186008X289010
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The evolution of DNA sequences can be described by discrete state continuous time Markov processes on a phylogenetic tree. We consider neighbor-dependent evolutionary models where the instantaneous rate of substitution at a site depends on the states of the neighboring sites. Neighbor-dependent substitution models are analytically intractable and must be analyzed using either approximate or simulation-based methods. We describe statistical inference of neighbor-dependent models using a Markov chain Monte Carlo expectation maximization (MCMC-EM) algorithm. In the MCMC-EM algorithm, the high-dimensional integrals required in the EM algorithm are estimated using MCMC sampling. The MCMC sampler requires simulation of sample paths from a continuous time Markov process, conditional on the beginning and ending states and the paths of the neighboring sites. An exact path sampling algorithm is developed for this purpose.
引用
收藏
页码:138 / 162
页数:25
相关论文
共 36 条
[21]  
JENSEN JL, 2005, 458 AARH U DEP MATH
[23]  
Lauritzen S. L., 1996, GRAPH MODELS
[24]  
LOUIS TA, 1982, J ROY STAT SOC B MET, V44, P226
[25]   Bayesian coestimation of phylogeny and sequence alignment -: art. no. 83 [J].
Lunter, G ;
Miklós, I ;
Drummond, A ;
Jensen, JL ;
Hein, J .
BMC BIOINFORMATICS, 2005, 6 (1)
[26]   A nucleotide substitution model with nearest-neighbour interactions [J].
Lunter, Gerton ;
Hein, Jotun .
BIOINFORMATICS, 2004, 20 :216-223
[27]   Mapping mutations on phylogenies [J].
Nielsen, R .
SYSTEMATIC BIOLOGY, 2002, 51 (05) :729-739
[28]   A NOTE ON THE DELTA METHOD [J].
OEHLERT, GW .
AMERICAN STATISTICIAN, 1992, 46 (01) :27-29
[29]   Gene finding with a hidden Markov model of genome structure and evolution [J].
Pedersen, JS ;
Hein, J .
BIOINFORMATICS, 2003, 19 (02) :219-227
[30]   Joint Bayesian estimation of alignment and phylogeny [J].
Redelings, BD ;
Suchard, MA .
SYSTEMATIC BIOLOGY, 2005, 54 (03) :401-418