Variation in evolutionary processes at different codon positions

被引:76
作者
Bofkin, Lee [1 ]
Goldman, Nick [1 ]
机构
[1] European Bioinformat Inst, European Mol Biol Lab, Hinxton, England
基金
英国惠康基金;
关键词
adaptive evolution; codon positions; phylogenetic inference; protein-coding sequences; sequence evolution;
D O I
10.1093/molbev/msl178
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Evolutionary studies commonly model single nucleotide substitutions and assume that they occur as independent draws from a unique probability distribution across the sequence studied. This assumption is violated for protein-coding sequences, and we consider modeling approaches where codon positions (CPs) are treated as separate categories of sites because within each category the assumption is more reasonable. Such "codon-position" models have been shown to explain the evolution of codon data better than homogenous models in previous studies. This paper examines the ways in which codon-position models outperform homogeneous models and characterizes the differences in estimates of model parameters across CPs. Using the PANDIT database of multiple species DNA sequence alignments, we quantify the differences in the evolutionary processes at the 3 CPs in a systematic and comprehensive manner, characterizing previously undescribed features of protein evolution. We relate our findings to the functional constraints imposed by the genetic code, protein function, and the types of mutation that cause synonymous and nonsynonymous codon changes. The results increase our understanding of selective constraints and could be incorporated into phylogenetic analyses or gene-finding techniques in the future. The methods used are extended to an overlapping reading frame data set, and we discover that overlapping reading frames do not necessarily cause more stringent evolutionary constraints.
引用
收藏
页码:513 / 521
页数:9
相关论文
共 36 条
  • [1] Adachi J, 1996, J MOL EVOL, V42, P459
  • [2] Determinants of adaptive evolution at the molecular level: the extended complexity hypothesis
    Aris-Brosou, S
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (02) : 200 - 209
  • [3] An isochore map of human chromosomes
    Costantini, M
    Clay, O
    Auletta, F
    Bernardi, G
    [J]. GENOME RESEARCH, 2006, 16 (04) : 536 - 541
  • [4] Felsenstein Joseph, 2004, Inferring_phylogenies, V2
  • [5] A second generation human haplotype map of over 3.1 million SNPs
    Frazer, Kelly A.
    Ballinger, Dennis G.
    Cox, David R.
    Hinds, David A.
    Stuve, Laura L.
    Gibbs, Richard A.
    Belmont, John W.
    Boudreau, Andrew
    Hardenbol, Paul
    Leal, Suzanne M.
    Pasternak, Shiran
    Wheeler, David A.
    Willis, Thomas D.
    Yu, Fuli
    Yang, Huanming
    Zeng, Changqing
    Gao, Yang
    Hu, Haoran
    Hu, Weitao
    Li, Chaohua
    Lin, Wei
    Liu, Siqi
    Pan, Hao
    Tang, Xiaoli
    Wang, Jian
    Wang, Wei
    Yu, Jun
    Zhang, Bo
    Zhang, Qingrun
    Zhao, Hongbin
    Zhao, Hui
    Zhou, Jun
    Gabriel, Stacey B.
    Barry, Rachel
    Blumenstiel, Brendan
    Camargo, Amy
    Defelice, Matthew
    Faggart, Maura
    Goyette, Mary
    Gupta, Supriya
    Moore, Jamie
    Nguyen, Huy
    Onofrio, Robert C.
    Parkin, Melissa
    Roy, Jessica
    Stahl, Erich
    Winchester, Ellen
    Ziaugra, Liuda
    Altshuler, David
    Shen, Yan
    [J]. NATURE, 2007, 449 (7164) : 851 - U3
  • [6] Early fixation of an optimal genetic code
    Freeland, SJ
    Knight, RD
    Landweber, LF
    Hurst, LD
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (04) : 511 - 518
  • [7] The genetic code is one in a million
    Freeland, SJ
    Hurst, LD
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1998, 47 (03) : 238 - 248
  • [8] ESTIMATION OF AVERAGE NUMBER OF NUCLEOTIDE SUBSTITUTIONS WHEN THE RATE OF SUBSTITUTION VARIES WITH NUCLEOTIDE
    GOJOBORI, T
    ISHII, K
    NEI, M
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1982, 18 (06) : 414 - 423
  • [9] GOLDMAN N, 1993, J MOL EVOL, V37, P650
  • [10] GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725