Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes

被引:276
作者
Yang, ZH
Swanson, WJ
机构
[1] UCL, Galton Lab, Dept Biol, London NW1 2HE, England
[2] Cornell Univ, Dept Mol Biol & Genet, Ithaca, NY 14853 USA
[3] Univ Calif Riverside, Dept Biol, Riverside, CA 92521 USA
关键词
synonymous rate; nonsynonymous rate; positive selection; partitioned data; lysin; MHC; maximum likelihood; Bayes;
D O I
10.1093/oxfordjournals.molbev.a003981
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The nonsynonymous to synonymous substitution rate ratio (omega = d(N)/d(S)) provides a sensitive measure of selective pressure at the protein level, with w values <1, = 1, and >1 indicating purifying selection, neutral evolution, and diversifying selection, respectively. Maximum likelihood models of codon substitution developed recently account for variable selective pressures among amino acid sites by employing a statistical distribution for the W ratio among sites. Those models, called random-sites models, are suitable when we do not know a priori which sites are under what kind of selective pressure. Sometimes prior information (such as the tertiary structure of the protein) might be available to partition sites in the protein into different classes, which are expected to be under different selective pressures. It is then sensible to use such information in the model. In this paper, we implement maximum likelihood models for prepartitioned data sets, which account for the heterogeneity among site partitions by using different W parameters for the partitions. The models, referred to as fixed-sites models, are also useful for combined analysis of multiple genes from the same set of species. We apply the models to data sets of the major histocompatibility complex (MHC) class I alleles from human populations and of the abalone sperm lysin genes. Structural information is used to partition sites in MHC into two classes: those in the antigen recognition site (ARS) and those outside. Positive selection is detected in the ARS by the fixed-sites models. Similarly, sites in lysin are classified into the buried and solvent-exposed classes according to the tertiary structure, and positive selection was detected at the solvent-exposed sites. The random-sites models identified a number of sites under positive selection in each data set, confirming and elaborating the results of the fixed-sites models. The analysis demonstrates the utility of the fixed-sites models, as well as the power of previous random-sites models, which do not use the prior information to partition sites.
引用
收藏
页码:49 / 57
页数:9
相关论文
共 41 条