What can and what cannot be inferred from pairwise sequence comparisons?

被引:17
作者
Baake, E [1 ]
机构
[1] Univ Munich, Inst Zool, D-80333 Munich, Germany
关键词
phylogenetic reconstruction; Markov processes on trees; additive distances; rate heterogeneity; identifiability;
D O I
10.1016/S0025-5564(98)10044-5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We address questions of identifiability in molecular phylogeny, the art of reconstructing the history of a sample of sequences given just the sequences at the leaves of the phylogenetic tree. Here, the 'history' consists of the tree topology, plus the transition probabilites which define the Markov process of sequence evolution along the branches of the tree. It is assumed that sequences have infinite length, and the pairwise joint distributions of letters at the leaves is taken to be known. We focus on two cases: (1) If the sites of a sequence evolve identically and independently, the topology can be reconstructed, but the one-way edge transition matrices cannot. However, the return-trip transition matrices are reconstructible for every edge, up to conjugacy in the case of internal edges. (2) If a rate factor varies from site to site, different topologies may produce identical pairwise joint distributions, even under the same distribution of rate factors. Consequently, identifiability of the topology is lost on the basis of pairwise sequence comparisons, even if the distribution of rate factors is known. The results are discussed in the context of additive measures of phylogenetic distance. (C) 1998 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:1 / 21
页数:21
相关论文
共 27 条
[1]  
BAAKE E, IN PRESS THEORET POP
[2]   RECONSTRUCTING THE SHAPE OF A TREE FROM OBSERVED DISSIMILARITY DATA [J].
BANDELT, HJ ;
DRESS, A .
ADVANCES IN APPLIED MATHEMATICS, 1986, 7 (03) :309-343
[3]   ASYNCHRONOUS DISTANCE BETWEEN HOMOLOGOUS DNA-SEQUENCES [J].
BARRY, D ;
HARTIGAN, JA .
BIOMETRICS, 1987, 43 (02) :261-276
[5]  
CAVENDER JA, 1978, MATH BIOSCI, V40, P271, DOI 10.1016/0025-5564(78)90089-5
[6]  
CHANG JT, 1991, COMPUTING SCIENCE AND STATISTICS, P254
[7]   Full reconstruction of Markov models on evolutionary trees: Identifiability and consistency [J].
Chang, JT .
MATHEMATICAL BIOSCIENCES, 1996, 137 (01) :51-73
[8]   Inconsistency of evolutionary tree topology reconstruction methods when substitution rates vary across characters [J].
Chang, JT .
MATHEMATICAL BIOSCIENCES, 1996, 134 (02) :189-215
[9]   PROBABILITY MODEL FOR INFERRING EVOLUTIONARY TREES [J].
FARRIS, JS .
SYSTEMATIC ZOOLOGY, 1973, 22 (03) :250-256
[10]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376