Evolutionary distance estimation under heterogeneous substitution pattern among lineages

被引:216
作者
Tamura, K
Kumar, S
机构
[1] Tokyo Metropolitan Univ, Dept Sci Biol, Hachioji, Tokyo 1920397, Japan
[2] Arizona State Univ, Dept Biol, Tempe, AZ USA
关键词
substitution rate; mutation rate; base composition; LogDet; computer simulation;
D O I
10.1093/oxfordjournals.molbev.a003995
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Most of the sophisticated methods to estimate evolutionary divergence between DNA sequences assume that the two sequences have evolved with the same pattern of nucleotide substitution after their divergence from their most recent common ancestor (homogeneity assumption). If this assumption is violated, the evolutionary distance estimated will be biased, which may result in biased estimates of divergence times and substitution rates, and may lead to erroneous branching patterns in the inferred phylogenies. Here we present a simple modification for existing distance estimation methods to relax the assumption of the substitution pattern homogeneity among lineages when analyzing DNA and protein sequences. Results from computer simulations and empirical data analyses for human and mouse genes are presented to demonstrate that the proposed modification reduces the estimation bias considerably and that the modified method performs much better than the LogDet methods, which do not require the homogeneity assumption in estimating the number of substitutions per site. We also discuss the relationship of the substitution and mutation rate estimates when the substitution pattern is not the same in the lineages leading to the two sequences compared.
引用
收藏
页码:1727 / 1736
页数:10
相关论文
共 27 条
[1]  
BULMER M, 1991, MOL BIOL EVOL, V8, P868
[2]   Compositional bias may affect both DNA-based and protein-based phylogenetic reconstructions [J].
Foster, PG ;
Hickey, DA .
JOURNAL OF MOLECULAR EVOLUTION, 1999, 48 (03) :284-290
[3]   INFERRING PHYLOGENIES FROM DNA-SEQUENCES OF UNEQUAL BASE COMPOSITIONS [J].
GALTIER, N ;
GOUY, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (24) :11317-11321
[4]   Bias-corrected paralinear and LogDet distances and tests of molecular clocks and phylogenies under nonstationary nucleotide frequencies [J].
Gu, X ;
Li, WH .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (10) :1375-1383
[5]   DATING OF THE HUMAN APE SPLITTING BY A MOLECULAR CLOCK OF MITOCHONDRIAL-DNA [J].
HASEGAWA, M ;
KISHINO, H ;
YANO, TA .
JOURNAL OF MOLECULAR EVOLUTION, 1985, 22 (02) :160-174
[6]   RIBOSOMAL-RNA TREES MISLEADING [J].
HASEGAWA, M ;
HASHIMOTO, T .
NATURE, 1993, 361 (6407) :23-23
[7]  
JUKES T H, 1969, P21
[8]  
Kumar S, 1996, GENETICS, V143, P537
[9]   Mutation rates in mammalian genomes [J].
Kumar, S ;
Subramanian, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) :803-808
[10]   MEGA2: molecular evolutionary genetics analysis software [J].
Kumar, S ;
Tamura, K ;
Jakobsen, IB ;
Nei, M .
BIOINFORMATICS, 2001, 17 (12) :1244-1245