The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC

被引:110
作者
Gharib, Walid H. [1 ,2 ]
Robinson-Rechavi, Marc [1 ,2 ]
机构
[1] Univ Lausanne, Dept Ecol & Evolut, Biophore, Lausanne, Switzerland
[2] Swiss Inst Bioinformat, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
adaptive evolution; codon model; base composition; BIASED GENE CONVERSION; AMINO-ACID SITES; PROTEIN-CODING GENES; PHYLOGENETIC ANALYSIS; EVOLUTIONARY MODELS; SEQUENCE EVOLUTION; LIKELIHOOD METHOD; GENOME; ISOCHORES; INFERENCE;
D O I
10.1093/molbev/mst062
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio omega. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the omega ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to similar to 80%) and low GC (similar to 30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test.
引用
收藏
页码:1675 / 1686
页数:12
相关论文
共 44 条
[1]  
Anisimova M, 2003, GENETICS, V164, P1229
[2]   Accuracy and power of Bayes prediction of amino acid sites under positive selection [J].
Anisimova, M ;
Bielawski, JP ;
Yang, ZH .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (06) :950-958
[3]   Accuracy and power of the likelihood ratio test in detecting adaptive molecular evolution [J].
Anisimova, M ;
Bielawski, JP ;
Yang, ZH .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (08) :1585-1592
[4]   Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites [J].
Anisimova, Maria ;
Yang, Ziheng .
MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (05) :1219-1228
[5]   Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models [J].
Anisimova, Maria ;
Kosiol, Carolin .
MOLECULAR BIOLOGY AND EVOLUTION, 2009, 26 (02) :255-271
[6]   Isochores and the evolutionary genomics of vertebrates [J].
Bernardi, G .
GENE, 2000, 241 (01) :3-17
[7]   THE MOSAIC GENOME OF WARM-BLOODED VERTEBRATES [J].
BERNARDI, G ;
OLOFSSON, B ;
FILIPSKI, J ;
ZERIAL, M ;
SALINAS, J ;
CUNY, G ;
MEUNIERROTIVAL, M ;
RODIER, F .
SCIENCE, 1985, 228 (4702) :953-958
[8]  
Cannarozzi GM, 2012, CODON EVOLUTION: MECHANISMS AND MODELS, P1, DOI 10.1093/acprof:osobl/9780199601165.001.0001
[9]   Biased Gene Conversion and the Evolution of Mammalian Genomic Landscapes [J].
Duret, Laurent ;
Galtier, Nicolas .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2009, 10 :285-311
[10]   Efficient Selection of Branch-Specific Models of Sequence Evolution [J].
Dutheil, Julien Y. ;
Galtier, Nicolas ;
Romiguier, Jonathan ;
Douzery, Emmanuel J. P. ;
Ranwez, Vincent ;
Boussau, Bastien .
MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (07) :1861-1874