An evolutionary model for protein-coding regions with conserved RNA structure

被引:33
作者
Pedersen, JS [1 ]
Forsberg, R
Meyer, IM
Hein, J
机构
[1] Aarhus Univ, Bioinformat Res Ctr, Aarhus, Denmark
[2] Univ Oxford, Dept Stat, Genome Anal & Bioinformat Grp, Oxford, England
关键词
RNA structure; coding region; overlapping information; context-dependent evolution; virus evolution;
D O I
10.1093/molbev/msh199
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Here we present a model of nucleotide substitution in protein-coding regions that also encode the formation of conserved RNA structures. In such regions, apparent evolutionary context dependencies exist, both between nucleotides occupying the same codon and between nucleotides forming a base pair in the RNA structure. The overlap of these fundamental dependencies is sufficient to cause "contagious" context dependencies which cascade across many nucleotide sites. Such large-scale dependencies challenge the use of traditional phylogenetic models in evolutionary inference because they explicitly assume evolutionary independence between short nucleotide tuples. In our model we address this by replacing context dependencies within codons by annotation-specific heterogeneity in the substitution process. Through a general procedure, we fragment the alignment into sets of short nucleotide tuples based on both the protein coding and the structural annotation. These individual tuples are assumed to evolve independently, and the different tuple sets are assigned different annotation-specific substitution models shared between their members. This allows us to build a composite model of the substitution process from components of traditional phylogenetic models. We applied this to a data set of full-genome sequences from the hepatitis C virus where five RNA structures are mapped within the coding region. This allowed us to partition the effects of selection on different structural elements and to test various hypotheses concerning the relation of these effects. Of particular interest, we found evidence of a functional role of loop and bulge regions, as these were shown to evolve according to a different and more constrained selective regime than the nonpairing regions outside the RNA structures. Other potential applications of the model include comparative RNA structure prediction in coding regions and RNA virus phylogenetics.
引用
收藏
页码:1913 / 1922
页数:10
相关论文
共 30 条
[1]  
[Anonymous], 2000, PHYLOGENETIC ANAL MA
[2]   Weighted neighbor joining: A likelihood-based approach to distance-based phylogeny reconstruction [J].
Bruno, WJ ;
Socci, ND ;
Halpern, AL .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) :189-197
[3]   Structural elements required for the localization of ASH1 mRNA and of a green fluorescent protein reporter particle in vivo [J].
Chartrand, P ;
Meng, XH ;
Singer, RH ;
Long, RM .
CURRENT BIOLOGY, 1999, 9 (06) :333-336
[4]   Asymmetric sorting of Ash1p in yeast results from inhibition of translation by localization elements in the mRNA [J].
Chartrand, P ;
Meng, XH ;
Huttelmaier, S ;
Donato, D ;
Singer, RH .
MOLECULAR CELL, 2002, 10 (06) :1319-1330
[5]  
EWENJS WJ, 2001, STAT METHODS BIOINFO
[6]   A hidden Markov Model approach to variation among sites in rate of evolution [J].
Felsenstein, J ;
Churchill, GA .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (01) :93-104
[7]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[8]  
Felsenstein J., 1993, PHYLIP PHYLOGENY INF
[9]   STATISTICAL TESTS OF MODELS OF DNA SUBSTITUTION [J].
GOLDMAN, N .
JOURNAL OF MOLECULAR EVOLUTION, 1993, 36 (02) :182-198
[10]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725