MAXIMUM-LIKELIHOOD TREES FROM DNA-SEQUENCES - A PECULIAR STATISTICAL ESTIMATION PROBLEM

被引:198
作者
YANG, Z
GOLDMAN, N
FRIDAY, A
机构
[1] NATL INST MED RES, MATH BIOL LAB, LONDON NW7 1AA, ENGLAND
[2] NAT HIST MUSEUM, DEPT ZOOL, LONDON SW7 5BD, ENGLAND
[3] UNIV CAMBRIDGE, DEPT ZOOL, CAMBRIDGE CB2 3EJ, ENGLAND
关键词
D O I
10.2307/2413599
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The parameter space of the phylogenetic tree estimation problem consists of three components, T, t, and theta. The tree topology T is a discrete entity that is not a proper statistical parameter but that can nevertheless be estimated using the maximum likelihood criterion. Its role is to specify the branch length parameters and the form of the likelihood function(s). Branch lengths t are conditional on T and are meaningful only for specific values of T. Parameters theta in the model of nucleotide substitution are common to all the tree topologies and represent such values as the transition/transversion rate ratio. T and t thus represent the tree, and theta represents the model. With typical DNA sequence data, differences in T have only a small effect on the likelihood, but changing theta will influence the likelihood greatly. Estimates of theta are also found to be insensitive to T, making it possible to obtain reliable estimates of theta and to perform tests concerning the model (theta) even if knowledge of the evolutionary relationship (T) is not available. In contrast, tests concerning t, such as testing the existence of a molecular clock, appear to be more difficult to perform when the true topology is unknown. in this paper, we explore the peculiarity of the parameter space of the tree estimation problem and suggest methods for overcoming some difficulties involved with tests concerning the model. We also address difficulties concerning hypothesis testing on T, i.e, evaluation of the reliability of the estimated tree topology. We note that estimation of and particularly tests concerning T depend critically on the assumed model.
引用
收藏
页码:384 / 399
页数:16
相关论文
共 54 条
[1]  
BISHOP MJ, 1988, MAJOR TOPICS PRIMATE, P150
[2]   MITOCHONDRIAL-DNA SEQUENCES OF PRIMATES - TEMPO AND MODE OF EVOLUTION [J].
BROWN, WM ;
PRAGER, EM ;
WANG, A ;
WILSON, AC .
JOURNAL OF MOLECULAR EVOLUTION, 1982, 18 (04) :225-239
[3]  
CAVALLISFORZA LL, 1967, EVOLUTION, V21, P550, DOI 10.1111/j.1558-5646.1967.tb03411.x
[4]  
CAVALLISFORZA LL, 1964, 11 P INT C GEN HAG, V3, P923
[5]  
CAVALLISFORZA LL, 1966, B INT STAT I, V41, P803
[6]  
CAVENDER JA, 1989, MOL BIOL EVOL, V6, P301
[7]  
DEBRY RW, 1992, MOL BIOL EVOL, V9, P537
[8]  
Edwards A. W. F., 1964, PHENETIC PHYLOGENETI, P67
[9]  
EDWARDS AWF, 1963, HEREDITY, V18, P553
[10]  
EDWARDS AWF, 1970, J ROY STAT SOC B, V32, P155