Solvable models of neighbor-dependent substitution processes

被引:9
作者
Berard, Jean [2 ,4 ]
Gouere, Jean-Baptiste [3 ]
Piau, Didier [1 ]
机构
[1] Univ Grenoble 1, UMR 5582, Inst Fourier, F-38402 St Martin Dheres, France
[2] Univ Lyon 1, UMR 5208, Inst Camille Jordan, F-69622 Villeurbanne, France
[3] Univ Orleans, UMR 6628, Lab MAPMO, F-45067 Orleans 2, France
[4] Univ Lyon, F-69003 Lyon, France
关键词
stochastic models of nucleotide substitution; exactly solvable models; CpG deficiency;
D O I
10.1016/j.mbs.2007.10.001
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We prove that a wide class of Markov models of neighbor-dependent substitution processes on the integer line is solvable. This class contains some models of nucleotidic substitutions recently introduced and studied empirically by molecular biologists. We show that the polynucleotidic frequencies at equilibrium solve some finite-size linear systems. This provides, for the first time up to our knowledge, explicit and algebraic formulas for the stationary frequencies of non-degenerate neighbor-dependent models of DNA substitutions. Furthermore, we show that the dynamics of these stochastic processes and their distribution at equilibrium exhibit some stringent, rather unexpected, independence properties. For example, nucleotidic sites at distance at least three evolve independently, and all the sites, when encoded as purines and pyrimidines. evolve independently. (c) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:56 / 88
页数:33
相关论文
共 13 条
[1]   Identification and measurement of neighbor-dependent nucleotide substitution processes [J].
Arndt, PF ;
Hwa, T .
BIOINFORMATICS, 2005, 21 (10) :2322-2328
[2]   DNA sequence evolution with neighbor-dependent mutation [J].
Arndt, PF ;
Burge, CB ;
Hwa, T .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (3-4) :313-322
[3]  
ARNDT PF, 2004, LECT NOTES INFORMATI, V53, P227
[4]  
BERARD J, 2005, SOLVABLE MODELS NEIG
[5]   NOMENCLATURE FOR INCOMPLETELY SPECIFIED BASES IN NUCLEIC-ACID SEQUENCES - RECOMMENDATIONS 1984 [J].
CORNISHBOWDEN, A .
NUCLEIC ACIDS RESEARCH, 1985, 13 (09) :3021-3030
[6]   The covariation between TpA deficiency, CpG deficiency, and G + C content of human isochores is due to a mathematical artifact [J].
Duret, L ;
Galtier, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (11) :1620-1625
[7]  
LAKE JA, 1987, MOL BIOL EVOL, V4, P167
[8]  
Liggett T M., 2005, Classics in Math.
[9]  
NAVIDI WC, 1992, MOL BIOL EVOL, V9, P1163
[10]  
Propp JG, 1996, RANDOM STRUCT ALGOR, V9, P223, DOI 10.1002/(SICI)1098-2418(199608/09)9:1/2<223::AID-RSA14>3.0.CO