Performance of likelihood ratio tests of evolutionary hypotheses under inadequate substitution models

被引:48
作者
Zhang, JZ
机构
[1] Penn State Univ, Inst Mol Evolutionary Genet, University Pk, PA 16802 USA
[2] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
关键词
likelihood ratio test; substitution models; molecular clock; transition transversion bias; rate variation among sites; molecular evolution;
D O I
10.1093/oxfordjournals.molbev.a026171
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In recent years, likelihood ratio tests (LRTs) based on DNA and protein sequence data have been proposed for testing various evolutionary hypotheses. Because conducting an LRT requires an evolutionary model of nucleotide or amino acid substitution, which is almost always unknown, it becomes important to investigate the robustness of LRTs to violations of assumptions of these evolutionary models. Computer simulation was used to examine performance of LRTs of the molecular clock, transition/transversion bias, and among-site rate variation under different substitution models. The results showed that when correct models are used, LRTs perform quite well even when the DNA sequences are as shea as 300 nt. However, LRTs were found to be biased under incorrect models. The extent of bias varies considerably, depending on the hypotheses tested, the substitution models assumed, and the lengths of the sequences used, among other things. A preliminary simulation study also suggests that LRTs based on parametric bootstrapping may be more sensitive to substitution models than are standard LRTs. When an assumed substitution model is grossly wrong and a more realistic model is available, LRTs can often reject the wrong model; thus, the performance of LRTs may be improved by using a more appropriate model. On the other hand, many factors of molecular evolution have not been considered in any substitution models so far built, and the possibility of an influence of this negligence on LRTs is often overlooked. The dependence of LRTs on substitution models calls for caution in interpreting test results and highlights the importance of clarifying the substitution patterns of genes and proteins and building more realistic models.
引用
收藏
页码:868 / 875
页数:8
相关论文
共 32 条
[1]   Synonymous and nonsynonymous substitutions in mammalian genes: Intragenic correlations [J].
Alvarez-Valin, F ;
Jabbari, K ;
Bernardi, G .
JOURNAL OF MOLECULAR EVOLUTION, 1998, 46 (01) :37-44
[2]  
Cunningham CW, 1998, EVOLUTION, V52, P978, DOI 10.1111/j.1558-5646.1998.tb01827.x
[3]   Ribonuclease k6: Chromosomal mapping and divergent rates of evolution within the RNase A gene superfamily [J].
Deming, MS ;
Dyer, KD ;
Bankier, AT ;
Piper, MB ;
Dear, PH ;
Rosenberg, HF .
GENOME RESEARCH, 1998, 8 (06) :599-607
[4]  
Efron B., 1994, INTRO BOOTSTRAP, V57, DOI DOI 10.1201/9780429246593
[5]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[6]  
FELSENSTEIN J, 1984, PHYLIP PHYLOGENY INF
[7]  
FITCH WALTER M., 1967, BIOCHEM GENET, V1, P65, DOI 10.1007/BF00487738
[8]   PERFORMANCE OF LIKELIHOOD RATIO TEST WHEN MODEL IS INCORRECT [J].
FOUTZ, RV ;
SRIVASTAVA, RC .
ANNALS OF STATISTICS, 1977, 5 (06) :1183-1194
[9]   SUCCESS OF MAXIMUM-LIKELIHOOD PHYLOGENY INFERENCE IN THE 4-TAXON CASE [J].
GAUT, BS ;
LEWIS, PO .
MOLECULAR BIOLOGY AND EVOLUTION, 1995, 12 (01) :152-162
[10]  
GAUT BS, 1994, MOL BIOL EVOL, V11, P620