A dependent-rates model and an MCMC-based methodology for the maximum-likelihood analysis of sequences with overlapping reading frames

被引:68
作者
Pedersen, AMK [1 ]
Jensen, JL [1 ]
机构
[1] Aarhus Univ, Inst Math, Dept Theoret Stat, DK-8000 Aarhus C, Denmark
关键词
overlapping reading frames; substitution process; dependent substitution rates; MCMC; hepatitis B;
D O I
10.1093/oxfordjournals.molbev.a003859
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a model and methodology for the maximum-likelihood analysis of pairwise alignments of DNA sequences in which two genes are encoded in overlapping reading frames. In the model for the substitution process, the instantaneous rates of substitution are allowed to depend on the nucleotides occupying the sites in a neighborhood of the site subject to substitution at the instant of the substitution. By defining the neighborhood of a site to extend over all sites in the codons in both reading frames to which a site belongs, constraints imposed by the genetic code in both reading frames can be taken into account. Due to the dependency of the instantaneous rates of substitution on the states at neighboring sires, the transition probability between sequences does not factorize and therefore cannot be obtained directly. We present a Markov chain Monte Carlo procedure for obtaining the ratio of two transition probabilities between two sequences under the model considered, and we describe how maximum-likelihood parameter estimation and likelihood ratio tests can be performed using the procedure. We describe how the expected numbers of different types of substitutions in the shared history of two sequences can be calculated, and we use the described model and methodology in an analysis of a pairwise alignment of two hepatitis B sequences in which two genes are encoded in overlapping frames. Finally, we present an extended model, together with a simpler approximate estimation procedure, and use this to test the adequacy of the former model.
引用
收藏
页码:763 / 776
页数:14
相关论文
共 14 条
[1]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[2]  
GANEM D, 1996, FIELDS VIROLOGY, V2, P2703
[3]  
GLKS WR, 1996, MARKOV CHAIN MONTE C, P1
[4]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[5]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174
[6]   A MAXIMUM-LIKELIHOOD APPROACH TO ANALYZING NONOVERLAPPING AND OVERLAPPING READING FRAMES [J].
HEIN, J ;
STOVLBAEK, J .
JOURNAL OF MOLECULAR EVOLUTION, 1995, 40 (02) :181-189
[7]   Probabilistic models of DNA sequence evolution with context dependent rates of substitution [J].
Jensen, JL ;
Pedersen, AMK .
ADVANCES IN APPLIED PROBABILITY, 2000, 32 (02) :499-517
[8]  
LI WH, 1985, MOL BIOL EVOL, V2, P150
[9]  
MUSE SV, 1995, GENETICS, V139, P1429
[10]  
MUSE SV, 1994, MOL BIOL EVOL, V11, P715